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Abstract 

Ascorbic acid is a potent antioxidant that detoxifies reactive oxygen species 
when plants are exposed to unfavorable environnnental conditions. In addition 
to its antioxidant properties, ascorbic acid and its biosynthetic precursors fulfill 
a variety of other physiological and nnolecular functions. A nnutation in the 
ascorbic acid biosynthesis gene VTC1, which encodes GDP-nnannose 
pyrophosphorylase, results in conditional root growth inhibition in the presence 
of annnnoniunn. To isolate suppressors of vtc1-1, which is in the Arabidopsis 
Colunnbia-0 background, seeds of the nnutant were subjected to ethyl 
nnethanesulfonate nnutagenesis. A suppressor nnutant of vtc1-1 2, svt2, with 
wild-type levels of ascorbic acid and root growth sinnilar to the wild type in the 
presence of annnnoniunn was isolated. Interestingly, svt2 has Arabidopsis 
Landsberg erecta features, although svt2 is delayed in flowering and has an 
enlarged nnorphology. Moreover, the svt2 genotype shares sinnilarities with Ler 
polynnorphisnn nnarkers and sequences, despite the fact that the nnutant derived 
fronn nnutagenesis of Col-0 vtc1-1 seed. We provide evidence that svt2 is not an 
artifact of the experinnent, a contannination of Ler seed, or a result of 
outcrossing of the svt2 nnutant with Ler pollen. Instead, our results show that 
svt2 exhibits transgenerational genotypic and phenotypic instability, which is 
nnanifested in a fraction of svt2 progeny, producing revertants that have Col-like 
phenotypic and genotypic characteristics. Sonne of those Col-like revertants 
then revert back to svt2-\\ke plants in the subsequent generation. Our findings 
have innportant innplications for undiscovered phenonnena in transnnitting 
genetic infornnation in addition to the Mendelian laws of inheritance. Our results 
suggest that stress can trigger a genonne restoration nnechanisnn that could be 
advantageous for plants to survive environnnental changes for which the 
ancestral genes were better adapted. 
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Introduction 

L-ascorbic acid (AA, vitamin C) is an important antioxidant with- 
multiple functions in many species. It serves as a scavenger of 
reactive oxygen species generated under adverse environmental 
conditions. However, AA also influences flowering time and senes- 
cence^"\ pathogen disease resistance^ "^, the biosynthesis of various 
plant hormones^"^ and root development^"^ ^ This suggests that AA 
and some of its intermediates have functions in addition to its anti- 
oxidant properties. 

Ascorbic acid biosynthesis in plants occurs predominantly through 
the D-mannose/L-galactose pathway^-'^l Given the multifaceted func- 
tions of A A in plants, there is a need to advance our understanding 
of how plants regulate the biosynthesis and accumulation of AA. 
Arabidopsis thaliana mutants deficient in AA have provided impor- 
tant insights into the breadth of molecular and physiological functions 
of AA. One of the Arabidopsis mutants, vtcl-1, contains a defect in 
the AA biosynthetic enzyme GDP-mannose pyrophosphorylase. The 
mutant was originally generated by ethyl methanesulfonate (EMS) 
mutagenesis of Col-0 wild-type seed^^. The vtcl-1 mutant contains 
a point mutation in amino acid 22 that converts a conserved proline 
into a serine The VTCl gene has recently been shown to be a 
determinant of ammonium sensitivity in plants. In the presence of 
ammonium, vtcl-1 mutants exhibit strongly reduced root growth 
in comparison to the wild type, a phenomenon that is independent 
of AA deficiency ^"^^ To better understand the mechanism through 
which VTCl mediates conditional ammonium sensitivity, it is 
important to identify regulatory partners of VTCl. To accomplish 
this, we undertook a suppressor mutagenesis approach of vtcl-1 
homozygous mutant seed in the hope of identifying vtcl-1 suppres- 
sor mutants that could then be isolated and studied. 

One of the suppressor mutants isolated in the generation, svt2 
{suppressor of vtcl-1 2), contained wild-type AA levels and developed 
roots similar to the wild type in the presence of ammonium. However, 
while characterizing the mutant genotypically, we observed that it 
lost the original vtcl-1 mutation (i.e., svt2 contained the homozygous 
wild- type allele). Furthermore, we determined that svt2, although gen- 
erated through EMS mutagenesis of Col-0 vtcl-1 mutant seed, was 
phenotypically and genotypically similar to Ler. Intriguingly, a small 
percentage of svt2 M^ plants produced offspring that have phenotypic 
and genotypic similarities to Col in the M^ generation. Even more 
remarkably, a small percentage of Col-like revertants in the M^ gener- 
ation produced progeny that exhibited phenotypic and genotypic svt2 
characteristics again in the M^ generation. 

Phenotypic instability of Arabidopsis alleles affecting a disease 
resistance gene cluster has recently been reported^^'. In their work, Yi 
and Richards described that exposure to EMS or through the genera- 
tion of different F^ hybrids induced phenotypic instability in the bal 
and cprl mutant alleles. The authors later proposed that the high 
phenotypic instability is caused by a genetic mechanism^ ^. 

The presented study focuses on describing and characterizing the 
Arabidopsis svt2 suppressor mutant and its phenotypic and geno- 
typic behavior. After illustrating the phenotypic features of svt2, we 
investigate transgenerational changes in the phenome and genome 
of svt2 and provide evidence that svt2 is a true mutant and not the 
result of an experimental artifact or contamination. Finally, we 



discuss our experimental findings in respect to the vtcl-1 mutant 
background and other reports that previously described similar 
phenomena of genome instability and restoration, and we briefly 
speculate on possible mechanisms of phenome and genome insta- 
bility in svt2. 

Materials and methods 

Plant material and growth conditions 

Arabidopsis thaliana Col-0 wild type and the previously described 
vtcl-1 mutant^^ (in the Col-0 background) were kindly provided by 
Patricia Conklin (SUNY Cortland, NY, USA). Ler-0 wild-type seed 
were obtained from The Arabidopsis Biological Resource Center 
(www.arabidopsis.org). Plants were grown in Metromix 360 pot- 
ting soil at 23 °C at both day and night with a 16-hour photoperiod at 
160 jomol photons m'^ s'^ (fluorescent bulbs). 

For assessment of root growth, seed of the wild types and mutant 
lines were surface- sterilized (see below) and grown on basal full 
strength Ix Murashige and Skoog (MS) medium without vitamins 
(Cat.# MSPOl, Caisson Laboratories, Inc., North Logan, UT), 
containing 1% Phytoblend (Cat.# PTPOl, Caisson Laboratories) 
in omni trays (Fisher Scientific, Pittsburgh, PA) as described^ ^ 
Sucrose was omitted from the tissue culture medium. The pH of the 
medium was adjusted with KOH to 5.7. Trays were sealed with two 
layers of 3M micropore tape (Fisher Scientific), put in vertical ori- 
entation, and placed in the growth chamber under long days (16 h 
light, 8 h dark) at 23 °C day and night, and 160 |imol photons m'^ s'^ 
in a growth chamber (Percival Scientific, Inc., Perry, lA). Each plate 
contained wild-type and mutant seed. Primary root length was 
measured in seven-day-old seedlings using a ruler. 

To assess AA content in leaf tissue, seeds of wild type and mutants 
were randomly sown on MetroMix 360 soil (BEG supplies Co., 
Burton, OH) in the same flat under the growth conditions described 
above. When plants were three weeks old, whole rosettes were har- 
vested for the AA assay. 

Seed-surface sterilization 

Seeds were soaked for 1 min in 50% ethanol, followed by washing 
the seeds in 50% bleach plus 0.01% sodium dodecyl sulphate for 
6 min. Finally, seeds were rinsed six times with sterile water and 
stored in 0.1% sterile Phytoblend agar for 2 d at 4°C^^ 

Ethyl methanesulfonate mutagenesis 

Seeds of homozygous vtcl-1 Arabidopsis thaliana (Col-0 back- 
ground) were mutagenized with 0.2% ethyl methanesulfonate as 
described (Figure 1;^^). Approximately, 1200 M^ seed were stratified 
for 4 days at 4°C in 0. 1 % agar, sown on MetroMix soil and grown 
as above. Plants were screened for wild-type AA levels using the 
nitroblue tetrazolium assay^'^. Additional suppressor mutants were 
isolated by pooling seeds generated from M^ plants. Putative 
mutants were isolated and allowed to self -pollinate to obtain seed. 

Pollen grain analysis and microscopy 

Pollen was taken from 4.5-week-old flowering plants of Col-0 and 
Ler wild type and vtcl-1 and svt2 M^ mutants, mounted in glycerol, 
and photographed using bright field settings on a Nikon E800 micro- 
scope equipped with a CoolSNAP cf CCD camera (Photometries, 
Tuscon, AZ, USA). 
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Table 1 . Forward (F) and reverse (R) sequences of primers 
used in analyzing the VTC1 gene and for amplifying five Col/Ler 
Insertion/Deletion (InDel) polymorphisms. 



" Mo ®^ Ml seeds/plants 

^ - genotyping 

- quantify AA 

- flowering time 

M2 seeds/plants 

f- root length on 1x MS 
- transplant to soil 

M3 seeds/plants 



Figure 1. Isolation of svt2. To isolate vtc1-1 suppressor mutants, 
homozygous vtc1-1 seed (in the Col-0 genetic background) were 
exposed to chemical mutagenesis using ethyl methanesulfonate 
(EMS). Over 1000 mutagenized seed (M^) were planted on soil and 
screened for wild-type levels of ascorbic acid. The only mutant isolated 
in the generation containing recovered ascorbic acid levels was 
svt2. The mutant was allowed to self-fertilize and was characterized 
phenotypically and genotypically in subsequent generations. 



Genomic DNA isolation 

Genomic DNA was isolated from rosette leaves following a previously 
described protocoll In case of genomic DNA isolation from vtcl-1 
seeds, a small amount of dried seeds was crushed and the extraction 
procedure described previously ^ was followed. Primers for the VTCl 
gene and for the Insertion/Deletion (InDel) polymorphisms were 
designed using sequence data available on The Arabidopsis Informa- 
tion Resource (TAIR) database (www.arabidopsis.org). Polymerase 
chain reaction (PGR) was used to amplify fragments of the VTCl 
gene for sequencing and to assess InDel polymorphisms. Sequences 
of primers used for sequencing and InDel analysis are summarized in 
Table 1. PGR reactions were run on 1.0 % agarose gels stained with 
ethidium bromide. 

Gene copy analysis using qPGR 

Quantitative PGR reactions were set up to measure gene copy 
number using 2.5 pmole gene-specific primers, 300 ng of genomic 
DNA diluted in DNase/RNase free water, and iQ SYBR Green 
supermix (Bio-Rad, Hercules, GA, USA) for a total volume of 10 |aL. 
Reactions without template were used as negative controls. Each 
single copy reaction was set up in triplicate and run in a Bio-Rad 
iGycler for 40 cycles. Threshold cycles (G^) were calculated using iQ 
software (Bio-Rad). 

Primer efficiencies (E) were calculated using cDNAs synthesized 
from RNA isolated from Gol-0 plants as previously described^ ^ 
cDNA samples were serially diluted across three orders of magnitude. 
Serial dilutions were amplified in triplicate using the same protocol 
as for the copy number experiment. The G^s of each triplicate were 
averaged and plotted against the dilution factor. A linear trend was 
fitted to the data and the slope of this trend was used to calculate 
E for each primer with the formula: e=10^^^''^°p^\ 

Gopy number of VTCl (AT2G39770) was calculated using the 
formula: Reported Quantity (RQ) = 1/E^^ normalized to the RQ of 
a known single copy gene (PAD4, AT3G52430;2«'2i). VTCl RQ was 
calculated from the average VTCl RQ of three biological replicates 
per genotype and was normalized to the average RQ of PAD4 from 



Primer Name 




5'3' 




VTC1 G1 F 




AAA AAT TOG TTC TAG ATG GAT GOT 




R 




ATG GOT GTA AAT TGG AAG AGA T 




VTC1 G2 F 




GAA COG TTG TOT OTA AAA TA 




VTC1 G2 R 




CAA ATG CCA TAA TCT GTT CC 




VTC1G3f 




CAA TTT TGC TTA CTT CTC T 




VTC1 G3 R 




TGG ATG CAA CCG ACA CAA AAC AAT 




VTC1 G4F 




ACA TTT TTA GCA GCT GGT ATT GAG 




VTC1 G4 R 




AGG TAA GAA CTG GCA GAG TAA AG 




VTC1 G5F 




TGG CTT GAG ACC ATT GAG T 




VTC1 G5 R 




GAG GCT TCC CCA CCG TGA GAT TTG 




VTC1 G6F 




CAA GCT GGA AAT CAA AAT CAC T 




VTC1 G6 R 




GCG CTG CTG CAA TCT TAG G 




VTC1 G7 F 




ACA AAT CTC ACG GTG GGG AAG C 




VTC1G7R 




TGG TTA ATT TGG GAG GAG A 




VTC1 G8 F 




CAA GGG CTC TAT GCT ATG GTG 




VTC1 GSR 




GCG TTT TGA TTG ATG CTT ATT C 




VTC1 G9 F 




GCG TGT ATG TGG AGC ACT ATG AT 




VTC1 G9R 




GTG GAG GGA ACT TAA GGG TAT TTT 




InDel 1 450919 F 


ATG GGT TTG TAA TCT CTG TCC A 




InDel 1 450919 R 


TAT GCG TTC CCA AAT TTG TTA TCT C 




Indel 2 451470 


F 


GGA GAG CCA AAC TGC TAT TAG A 




Indel 2 451470 


R 


AAC CGC CTC CAT TTG CAC CTT ATG 




Indel 3 469762 


F 


GTG ACC GAG TTT TGC TTT GTT CAT 




Indel 3 469762 


R 


CTC GTT TCT TTT CTG GGG TTG TAG 




Indel 4 449053 


F 


GAA AGA AAG GAG GGA AAG ACA 




Indel 4 449053 


R 


GCG CAT GCG CAT ACA CTG A 




Indel 5 455100 


F 


ACT TGC TTA ATG GTT TCT TTG TA 




Indel 5 455100 


R 


GCG CAC TGG TAT TGG CTT AG 





three replicates of each respective genotype, all run in the same reac- 
tion plate. 

Sequencing analysis 

PGR products were purified using the Qiagen Miniprep Kit. Dye- 
terminator based DNA sequencing was performed at the Genomics 
Facility in the Department of Biology at West Virginia University. 
Sequence alignments were performed using the BioEdit program 
(http : //www. mbio . nc su . edu/bioedit/bioedit . html) . 

Ascorbic acid quantification 

To screen mutants, AA levels were analyzed qualitatively in small 
pieces of two- week-old rosette leaves using the nitroblue tetrazolium 
assay previously described' '\ The AA content was determined in whole 
rosettes of three- week-old plants using the iron reduction assay^. 

Statistical analysis 

Experiments were performed at least three times. Figures represent 
individual experiments. Data were expressed as mean values ± SE. 
P values were determined by Student's t test analysis. 
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Results 

Isolation of svt2 

Our laboratory is interested in understanding how the VTCl gene, 
which is essential for the biosynthesis of GDP-mannose and AA, is 
regulated. This would help deciphering the pleiotropic phenotypes 
displayed by vtcl-1, including its hypersensitivity to ammonium^"^^ 
We employed a gene suppressor analysis with the goal of iden- 
tifying novel genes that interact or regulate VTCl. Seed of the 
vtcl-1 mutant, which is in the Col-0 genetic backgrounds^, were 
subjected to chemical mutagenesis using EMS^l One thousand 
and one hundred mutagenized vtcl-1 seeds (M^ generation) were 
planted onto soil and screened for recovered (wild-type) leaf AA 
content using the qualitative nitroblue tetrazolium test^^^. One of 
the mutants exhibited wild-type AA levels in the generation. 
This mutant was named svt2 (suppressor of vtcl-1 2), isolated, 
and further characterized. The mutant was allowed to self-fertilize 
and seeds from the plant were collected (M^ generation) (Figure 1). 
Note that we isolated additional suppressor mutants by pooling 
seed and by screening for long roots on Ix Murashige and Skoog 
(MS) medium containing ammonium. Six suppressor mutants were 
identified among 2000 plants. seed were collected and screened 
for long roots again to test for segregation. progeny of one line 
had all long roots, whereas the other five lines segregated in a ratio of 
three plants producing long roots, and one plant having short roots. 
Figure 2 summarizes data of four of these suppressor mutants, with 
D3-4 homogenously producing long roots, whereas D3-3, D3-7, 
and D3-15 developed long and short roots in a 3:1 ratio. As is 
illustrated in Figure 2 A, these suppressor mutants developed roots 
that were significantly longer than those of the Col-0 wild type. 
Analysis of the total AA content revealed that the suppressor D3-4 
had an AA content comparable to the Col-0 wild type, whereas that 
of vtcl-1 was only approximately 40% of that of the wild type^^'^^ 



Finally, sequence analysis of these four suppressor mutants 
demonstrated a lack of the vtcl-1 mutation (Figure 2C). Except for 
the assessments described above, these suppressor mutants were 
not yet characterized further. 



Root lengths of Col-0 WT, vtc1-1, and D3 suppressor mutants 
(mm) 

5 Data Files 

http://dx.doi.org/10.6084/m9.figshare.103765 



Total ascorbic acid of Col-0 WT, vtcl-1, and D3 suppressor 
mutants (pmol gFW-1) 

1 Data File 

http://dx.doi.org/10.6084/m9.figshare.103766 



svt2 has similarities with the Lerphenotype, but has also 
phenotypic characteristics that are distinct from Ler 

The first observation we made when characterizing svt2 plants was 
that svt2 exhibited a phenotype reminiscent of the her ecotype with 
the characteristic round leaves and erect morphology when compared 
to Col (Figure 3A). However, svt2 also had features that were distinct 
fi-om the Ler phenotype, including overall enlarged vegetative and 
reproductive morphology (insets of rosettes and flowers in Figure 3 A). 
In addition, svt2 was strongly delayed in flowering compared to the 
Col-0 and Ler-0 wild types and the vtcl-1 mutant (Figure 3 A, 3B). 
Primary inflorescences in four- week-old plants were 1 .4-times sig- 
nificantly longer in the vtcl-1 mutant and approximately twice as 
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Figure 2. Phenotypic and genotypic characterization of additional vtc1-1 suppressor mutants. (A) Root length in seven-day-old 
seedlings grown on 1x MS. Bars represent means ± SE of 18-73 individuals. Since D3-4 homogenously produced long roots, all individuals 
were included in the calculations. As D3-3, D3-7, and D3-15 developed long and short roots in an approximate 3:1 ratio, only individual 
seedlings that produced long roots were included in the calculations. (B) Total ascorbic acid content per gram fresh weight in whole rosettes 
of three-week-old plants. Bars represent means ± SE of three (Col-0 and vtc1-1) or 24 individual replicates. ***P < 0.001 by Student's f-test 
indicates significant differences in comparison to the Col-0 wild type. (C) Sequences of the Col-0 wild type, the vtc1-1 mutant and four 
suppressor mutants. The arrow points to the vtc1-1 mutation, a conversion of cytosine to a thymine. 
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long in the Ler-0 wild type compared to the Col-0 wild type. In svt2 
mutant plants, however, buds of primary inflorescences only began 
to emerge when plants were four weeks old (Figure 3A, 3B). The 
flowering data are consistent with previous reports, with Ler-0 wild 
type entering the reproductive phase before Col-0 wild type. An 
early flowering phenotype of vtcl-1 has been reported previously^ 

The AA content in svt2 was similar to levels quantified in Col-0 and 
her-0 wild types, whereas vtcl-1 contained approximately 30% of 
the AA content as expected^"^'^^ (Figure 3C). Finally, we investigated 
whether svt2 also exhibits recovered root development in the pres- 
ence of ammonium by growing the four genotypes in full strength 
Ix MS medium. The vtcl-1 mutant is conditionally hypersensitive 
to ammonium^'^'^^ Figure 3D illustrates that root length in svt2 was 
the same as in Col-0 wild type, whereas root development was 
strongly inhibited in vtcl-1 as expected. 

The enlarged morphology of svt2 raises the question as to whether 
svt2 is polyploid. In order to test this, we assessed the size of pollen 
grains from the Col-0 and Ler-0 wild-types and vtcl-1 and svt2 
mutants. As is shown in Figure 4, pollen grains of the four geno- 
types are similar in size. In addition, using qPCR, we determined 
the number of VTCl gene copies in the four genotypes. Our results 
revealed that VTCl is present as a single copy gene in both the Col-0 
and Ler-0 wild types and in the vtcl-1 and svt2 mutants (Table 2). 
Although an extensive chromosome analysis has not yet been per- 
formed in svt2, our results suggest that the mutant does not contain 
additional sets of chromosomes. 



suggesting a her insertion polymorphism. Thus, these data imply 
that svt2 shares both phenotypic and genotypic similarities with her. 

We therefore assessed five additional Insertion/Deletion (InDel) 
polymorphisms randomly chosen across the five Arabidopsis chro- 
mosomes (Table 1) in svt2 compared to the Col-0 and Ler-0 wild 
types and sequenced the entire VTCl gene and the promoter region 
tested. Our data show that the PCR products generated for those 
five InDels using svt2 genomic DNA had the same electrophoretic 
mobility as those produced from L^r-0 genomic DNA (Figure 7). 
Moreover, sequence analysis of the VTCl gene and promoter region 
revealed that svt2 contained a 283 bp insertion in the VTCl promot- 
er (Figure 5C). The insertion is highlighted in gray in Figure S 1 . Note 
additional single nucleotide polymorphisms as indicated by upright 
arrows in Figure 5C and Figure SI. When we aligned the VTCl 
gene sequence obtained from svt2 with that of the vtcl-1 mutant, the 
VTCl Col-0 gene sequence deposited in the TAIR database, and the 
VTCl Ler GenBank sequence, the VTCl gene sequence in svt2 shared 
similarities with Ler (upright arrows in Figure 5C, Figure SI) and Col 
(arrows pointing down in Figure SI). However, note that there are 
sequences that are unique to svt2 and are not shared between Col, 
vtcl-1 or her (arrowheads in Figure SI). Finally, note the overlap 
in sequences between Col, vtcl-1, svt2 and Ler on the 5' end of the 
sequence flanking the insertion (at approximately base pair 1990); 
see left-facing horizontal black arrow in Figure S 1 compared to the 
sequence flanking the 3 ' end of the DNA sequence insertion (start- 
ing at base pair 2273); see right-facing horizontal black arrow in 
Figure S 1 . 



Taken together, based on the phenotypic observations, our data 
suggest that svt2 represents a novel vtcl-1 suppressor mutant with 
recovered AA content and root development. Next, we character- 
ized svt2 genotypically in order to determine whether svt2 repre- 
sents an intragenic or extragenic suppressor. 

sW2 shares genome similarity with Ler 

To determine whether svt2 represents an intragenic suppressor, i.e., to 
test whether the suppressor mutation is present within the VTCl gene, 
we designed nine overlapping PCR primer sets spanning the entire 
VTCl gene and approximately 500 bp of the promoter region directly 
upstream of the first base in the 5' UTR (Table 1, Figure 5A). PCR 
products were generated from genomic DNA extracted from Col-0 
and Ler-0 wild types, and vtcl-1 and svt2 mutants. In eight of the 
nine primer pairs covering the entire VTCl gene, the PCR products 
generated using svt2 genomic DNA had the same electrophoretic 
mobility as those generated using Col-0 wild-type genomic DNA. 
However, this was not the case for the first primer set. The GIF/ 
GIR primer set, used to amplify the VTCl promoter region, gener- 
ated a larger PCR product in svt2 than in the wild type (Figure 5B, 
Figure 6). The PCR product in the wild type was 567 bp, whereas 
that in svt2 had a size of approximately 850 bp, suggesting that 
svt2 contained an approximately 300 bp insertion in this region. 
We repeated the PCR analysis of the VTCl promoter region using 
the GIF-GIR and the G1F-G2R primer sets that should generate a 
PCR product of 567 bp and 751 bp, respectively (Figure 5 A). The 
expected size was obtained for the Col-0 wild type and the vtcl-1 
mutant. However, approximately 300 bp larger PCR products were 
detected in the svt2 mutant and the L^r-0 wild type (Figure 5B), 



Finally, most intragenic suppressor mutants still contain the original 
mutation in addition to the suppressor mutation. Therefore, we expected 
that the vtcl-1 mutation is still present in svt2. However, our sequencing 
analysis demonstrated that svt2 did not contain the vtcl-1 mutation any- 
more and that the mutation reverted back to the homozygous wild-type 
allele (Figure 5D; green shading in Figure SI). 

In summary, our data demonstrate that svt2 shares DNA sequence 
similarity with Col and L^r, but also contains DNA sequences that 
are unique to this mutant. This is particularly remarkable because 
svt2 was generated in the vtcl-1 Col-0 background. Also, svt2 did 
not contain the original vtcl-1 mutation anymore. Although our 
data already argue against svt2 being a result of an artifact of the 
experiment or a contamination with Ler, we analyzed subsequent 
svt2 generations and discovered additional characteristics that are 
unique to svt2. 



f ^ 
Quantitative PCR to verify tliat VTC1 is a single copy gene in 
Col-0 and Ler-0 wild types and vtc1-1 and svt2 mutants 

2 Data Files 

http://dx.doi.org/10.6084/m9.figshare.103771 

V J 

si/f2 exhibits phenotypic and genotypic instability 

Our initial observations revealed that approximately 10% of svt2 
plants displayed a Col-like phenotype. Therefore, we planted 
svt2 M^, M^, and progeny to check whether this result could be 
repeated and to determine segregation ratios (Table 3). Additionally, 
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Figure 3. Phenotypic characterization of svt2. (A) Flowering phenotype of four-week-old Col-0 wild type, the vtc1-1 and svt2 mutants 
and the Ler-0 wild type. Insets show rosette phenotypes of the four genotypes when plants were three weeks old and the flower phenotype 
of six-week-old plants, respectively. (B) Primary inflorescence length when plants were four weeks old. Bars represent means ± SE of eight 
individual replicates. (C) Total ascorbic acid content per gram fresh weight in whole rosettes of three-week-old plants. Bars represent means ± SE 
of three individual replicates. (D) Root length in seven-day-old seedlings grown on 1x MS. Bars represent means ± SE of 30-90 individuals. 
*P < 0.05, **P < 0.01 , ***P < 0.001 by Student's f-test indicate significant differences in comparison to Col-0 and Ler-0 wild type, respectively. 
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Primary inflorescence length of Col-0 WT, vtc1-1, svt2 (M1) and 
Ler-OWT (cm) 

1 Data File 

http://dx.d0i.0rg/l 0.6084/m9.f igshare.1 03767 



Total ascorbic acid of Col-0 WT, vtc1-1, svt2 (M1) and Ler-OWT 
(Ijmol gFW-1) 

3 Data Files 

http://dx.d0i.0rg/l 0.6084/m9.f igshare.1 03768 



Root lengths of Col-0 WT, vtc1-1, svt2 (M2) and Ler-OWT (mm) 

3 Data Files 

http://dx.d0i.0rg/l 0.6084/m9.f igshare.1 03769 







10 Mm 



Col-0 



vtc1-1 svt2 (M^) Ler-0 



Figure 4. Pollen phenotype of Col-0 wild type, the vtc1-1 and svt2 
mutants and the Ler-0 wild type. Light images were taken wlien 
plants were 4.5 weeks old. Scale bar represents 10 pm. 



Table 2. Quantitative PCR to verify that VTC1 is a single 
copy gene in Col-0 and Ler-0 wild types and vtc1-1 
and svf2 mutants. Quantitative PCR was performed as 
described in Materials and Methods. The PAD4 gene is 
a known single copy gene. Therefore, an RQ/RQ ratio of 
approximately 1 indicates that VTC1 is present in similar 
quantity as PAD4, and therefore a single-copy gene. 





W RQ/RQ 1 






VTC1/PAD4 




Col-OWT 


0.3796 




vtc1-1 


0.5843 


VTC1 


svt2 


0.5504 


E=1.8 


Ler-OWT 


0.6329 




Col-0 


0.3153 




vtc1-1 


0.5292 


VTC1 


svt2 


0.4946 


E=2 


Ler-0 WT 


0.5807 





we investigated whether svt2 progeny that were phenotypically 
Col-Uke revertants would produce svt2 (L^r-Uke) offspring in the 
next generation. 

As summarized in Table 3, revertants could only be detected when 
a relatively large population was planted. In the svt2 generation, 
only 1% of Col-like revertants were detected. In contrast, 8-10% of 
svt2 plants displayed a Col-like phenotype, whereas no revertants 
were detected in the svt2 generation. These Col-like revertants 
were isolated and seeds were collected from individual plants and 
the phenotype of the progeny in the generation was assessed in 
some examples. In most cases, reversion appeared to be stable, i.e., 
once svt2 plants reverted, displaying a Col-like phenotype in the M2 
generation, their progeny continued to appear as Col-like plants. 
This was the case for the progeny of the A8 and G7 plants listed 
in Table 3. However, out of 63 progeny from the Kl revertant plant, 
one reverted back to a svt2-\ikQ phenotype (Table 3), i.e., the Kl 
double revertant switched from svt2 phenotype in the generation 
to a Col-like phenotype in the M2 generation, and then reverted back 
to a 5v?2-like phenotype in the generation. Note that only a small 
number of progeny was planted. In a second experiment, the svt2 
Col Rl revertant produced 20 individuals displaying a 5v?2-like phe- 
notype (Table 3). This represents a larger reversion percentage than 
in the Kl double revertant (22.7% vs. 1.6%). This may be explained 
by the genotypic make-up of the Col-like reverted parents and will 
be presented in the next section. Figure 8 illustrates the phenotypic 
appearance of three examples of svt2 Col single revertants (Col Rl, 
Col R2, Kl Col R) and a svt2 Col svt2 double revertant (Kl 
Col R svt2 R). 

Next we tested whether a Col-like revertant phenotype correlated 
with a Col-like genotype. Likewise, we would expect that a svt2 
Col svt2 double revertant phenotype corresponds with svt2-\ik& 
genomic markers. To check this we isolated genomic DNA from 
Col-0 and wild types, svt2, vtcl-1 and revertant mutants, and 
PCR-amplified the five randomly selected InDel polymorphisms 
plus the InDel polymorphism in the VTCl promoter (Table 1). 
In all cases but the svt2 Col Rl revertant, the ^vr2-like rever- 
tant plants (labeled svt2 Col revertants 1 through 5) produced 
PCR products that where of the same electrophoretic mobility as 
the PCR products generated using Col-0 wild-type genomic DNA. 
In contrast, svt2 plants and svt2 plants that displayed an 
svt2 phenotype, gave rise to PCR products that were of the same 
electrophoretic mobility as those of the L^r wild type (Table 4, 
Figure 9). In addition, the double revertant plant Kl (labeled svt2 
M2 Kl Col R) was genotyped in both its and generations. 
The Kl plant produced InDel PCR products similar to those of the 
Col-0 wild type in the M2 generation. However, the M3 genera- 
tion that displayed svt2-\ik& morphology produced PCR products 
that were comparable to the InDel PCR products generated using 
L^r genomic DNA (Table 4). The svt2 Col Rl (highlighted 
in red in Table 4 is intriguing, because it appears to contain DNA 
that is similar to both Col and Ler genomic DNA. This suggests 
the presence of chimeric genome sectors, which may explain the 
higher percentage of Col-like revertants compared to svt2 Kl 
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Ler-O 
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^ ^ ^ ^ ^ ^. ... ^. ^ ^ ^ ^ ^ . AC AT AAT AC C AT C C C T T T AAAAAC AC AGAAT T T T T T T T AT C AT C T C T GAA 
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Figure 5. Genotypic characterization of svt2. (A) VTC1 Col-0 gene model. Light green box indicates VTC1 gene promoter region, light blue 
rectangles indicate 5' and 3' UTRs, dark blue rectangles indicate exons, and lines indicate introns. Shown is the location of the i/fc^"/ mutation within 
the first exon, primer locations, and polymorphism insertion of 283 bp in Ler-0 VTC1. (B) PGR amplification of the VTC1 promoter region in the Col-0 
wild type, vtc1-1 and s\/f2 mutants and Ler-0 wild type. (-) indicates negative control, no DNA. (C) Partial sequence alignment of the VTC1 promoter 
region from the TAIR database (Col-0), sequenced Col-0 wild type, vtc1-1 and sW2 mutants, sequenced Ler-0 wild type and the Ler-0 sequence 
obtained from GenBank. The alignment shows the sequence insertion in the svt2 mutant, the Ler-0 wild type and the GenBank sequence. Arrows 
indicate single nucleotide polymorphisms between the Ler-0 and Col-0 sequence. (D) Point mutation in vtc1-1, a conversion from a cytosine to a 
thymine. 



PCR amplification of tlie VTC1 promoter region in tlie Col-0 wild 
type, vtc1-1 and svt2 mutants 



1 Data File 

http://dx.d0i.0rg/l 0.6084/m9.f igshare.1 03770 
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Figure 6. Molecular characterization of svt2. Amplification of the VTC1 gene including -500 bp of the promoter region using a series of nine, 
overlapping primers (G1F-hR through G9F-hR) in both Col-0 wild type and svt2W\^ mutant genomic DNA. The last lane in each gel contained 
a negative control (water instead of DNA). Red arrows indicate the different sized PCR products using the same primer set. 
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Table 3. Summary of revertant data. The table summarizes the number of plants screened in each of three svt2 generations 
(M^, and Mg), screens of revertant progeny from Col-like revertants (A8, G7, K1), and the revertant progeny of a Ler-like 
line (K1 Col R svt2R). The percent reversion is shown in the last column. Although the number of progeny plants tested is 
relatively large, some lines did not give rise to revertant progeny. R denotes revertant. "indicates mutant plants that were also 
analyzed genotypically (see Table 4). 



Experiment 




Total # of plants 


# of phenotypic revertants 


% reversion 




svt2W\^ 


63 


0 


0 


1 


svt2W\^, 3 of 7 revertants tested further: 
svt2 AS Col R Mg 
svt2G7 Col R M3 
svt2K1 Col R M3* 
svt2K^ Col R svt2R M/ 


78 
64 
64 
63 
96 


7 (Col phenotype) 

0 

0 

1 (s\/f2 phenotype) 
0 


8.97 

0 

0 

1.58 
0 




svt2 M3 


96 


0 


0 




svt2W\^ 


96 


1 (Col phenotype) 


1.04 


2 


svt2M^, 2 of 5 revertants tested further: 
svt2 Col R1 M3* 
svt2 Col R4 M3* 


62 
88 
96 


5 (Col phenotype) 
20 {svt2 phenotype) 
0 


8.06 

22.73 

0 


3 


svt2 M2 


96 


10 (Col phenotype) 


10.42 



Col R. Note that the PCR results are in line with the sequencing 
analysis of the revertants. That is, Col-like revertants and 5v?2-like 
revertants share sequence similarity with Col-0 and Ler wild type, 
respectively (Figure S2). 

Taken together, these data suggest (i) transgenerational phenotypic 
and genotypic instability in svt2, and that (ii) svt2 offspring do 
not segregate in a Mendelian fashion. In an attempt to obtain first 
insights toward a mechanism that is causing this genotypic instability, 
we investigated whether transgenerational epigenetic inheritance 
could play a role. 



Genome instability in svt2 does not appear to be triggered 
by a transgenerational epigenetic mechanism 

To investigate whether genome instability is caused by transgenera- 
tional epigenetic inheritance in the svt2 mutant, we performed recip- 
rocal crosses between svt2 mutants and Col-0 wild-type plants. It is 
possible that through the EMS mutagenesis of vtcl-1 seeds, genes 
involved in the regulation of epigenetic alterations were altered, 
whereby their activity was affected. There is increasing evidence in 
both plants and animals that epigenetic marks are not always cleared 
between generations. Incomplete erasure at genes associated with 
a measurable phenotype results in unusual patterns of inheritance 
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Figure 8. Phenotype of svt2 revertants. Plants were three weeks 
old when photographs were taken. Top row represents controls, Col-0 
wild type, vtc1-1 and s\/f2 mutants, and Ler-0 wild type. Bottom row 
represents three Col-like revertants, svt2 Col R1 M3, svt2 Col R2 M3, 
svt2 K1 Col R M3, and a double revertant, svt2 K1 Col R svt2 R M^. 
R stands for revertant. 



Table 4. Summary of PCR-based molecular genotypes. With the 
exception of svt2 Col R1 M^, where Col and Ler markers and one 
heterozygous marker were found (highlighted in red), phenotype 
matched genotype. That is, a Col-like phenotype correlated with 
the presence of Col polymorphisms, while a Ler-like phenotype 
correlated with Ler polymorphisms. C, L, and H refer to Col, Ler, or 
heterozygous, respectively. R denotes revertant. n.d., not detected. 



from one generation to the next, termed transgenerational epigenetic 
inheritance^^'^\ Therefore, analysis of the progeny of the reciprocal 
crosses is expected to provide some first insights on the possibility of 
transgenerational epigenetic inheritance that is transmitted maternally. 
If this were the case, only progeny of crosses with a maternal svt2 
donor should have a svt2-\ikQ phenotype. To determine the genotypes 
of the progeny of the reciprocal crosses, we performed another 
InDel polymorphism assay as described above. In addition, progeny 
were also screened using the VTCl InDel promoter polymorphism. 
Table 5 contains a summary of the InDel screen for progeny from 
each reciprocal cross. In all but six of the progeny from the recip- 
rocal crosses, PCR products similar to those obtained using Col 
and her genomic DNA, respectively, were generated, suggesting 
that the of the reciprocal crosses were heterozygous. A similar 
result was obtained for the VTCl promoter polymorphism marker in 
all reciprocal crosses. Note, however, that for some polymorphisms 
and irrespective of whether svt2 or Col-0 served as female or male 
donor, respectively, PCR products comparable to those obtained 
using Ler-0 wild-type DNA were prevalent (highlighted in red in 
Table 5). This is surprising because heterozygosity was expected 
at all loci. This suggests that some parts of the genome were not 
inherited equally from both parents. Taken together, these results 
suggest that maternal epigenetic inheritance may not be the cause 
of genome instability in svt2. However, at some loci svt2-\ikQ alleles 
dominate over Col-0. 



Genotype 


InDel 1 
450919 


InDel 2 
451470 


InDel 3 
469762 


InDel 4 
449053 


InDel 5 
455100 


G1F + 
G2R VTC1 


Col-0 WT 


C 


C 


C 


C 


C 


C 


vtc1-1 


C 


C 


C 


C 


C 


C 


Ler-0 WT 


L 


L 


L 


L 


L 


L 


svt2M^ 


L 


L 


L 


L 


L 


L 


svt2 M2 


L 


L 


L 


L 


L 


L 


svt2 Col R1 M2 


C 


L 


0 


H 


C 


C 


svt2 Col R2 


C 


C 


C 


C 


C 


C 


svt2 Col R3 M2 


C 


C 


C 


C 


n.d. 


C 


svt2 Col R4 M2 


C 


C 


C 


C 


C 


C 


svt2 Col R5 M2 


C 


C 


C 


C 


C 


C 


svt2 K1 Col R 


C 


C 


C 


c 


C 


C 


svt2 K1 Col R 
svt2 R M3 


L 


L 


L 


L 


L 


L 




f \ 



Summary of PCR-based molecular genotypes 

1 Data File 

http://dx.d0i.0rg/l 0.6084/m9.figshare.1 03772 

V J 



Summary of PCR-based molecular genotypes 

1 Data File 

http://dx.d0i.0rg/l 0.6084/m9.f igshare.1 03773 



Discussion 

The svt2 mutant was initially identified as a putative suppressor 
of thQ AA-dQ^cicnt Arabidopsis mutant vtcl-1, as was evident in 
wild-type levels of AA (Figure 3C) and recovered root develop- 
ment in the presence of ammonium (Figure 3D). However, svt2 
manifests other interesting characteristics, including genotypic 
and phenotypic instability. These unique features could aid in our 
understanding of the complex mechanisms controlling genome 
instability and restoration. 

svt2 is a novel Arabidopsis mutant and not a result of an 
experimental artifact, seed contamination, or outcrossing 

Several lines of evidence support our findings that svt2 is a novel 
mutant. First, svt2 was the only suppressor mutant isolated among 
over 1000 EMS-mutagenized seeds to show unique phenotypic 
characteristics. Astonishingly, our genetic analysis revealed that both 
maternal and paternal alleles were affected in five randomly selected 
InDel polymporphism loci, the newly discovered InDel polympor- 
phism in the VTCl promoter, and additional SNPs (Figure 5B-D, 
Figure 6, Figure SI). These data demonstrate that svt2 has acquired 
new characteristics, presumably as a result of EMS mutagenesis, and 
that svt2 is neither Col nor Ler. These data also argue against svt2 
being an experimental or PCR artifact. 

Second, a number of data provide strong arguments against seed 
contamination. (1) With high reproducibility, descendents of the 
original svt2 mutant produce offspring revertants with Col-like 
features (Table 3, Table 4; Figure 8, Figure 9). (2) One of the 
Col-like revertants, svt2 Col Rl M^, exhibited heterozygosity at 
some of the InDels tested (Table 4). (3) One of those Col-like 
revertants, svt2 Kl Col R M3, produced progeny that reverted 
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Figure 9. Insertion/Deletion polymorphism analysis in svt2, Col-0, Ler-0, and revertants. PGR amplification of the Col/Ler VTC1 promoter 
polymorphism in svf2 plants and svf2revertant (R) plants, amplified with the VTC1 GIF and G2R primers. (-) indicates negative control, no DNA. 



Table 5. Reciprocal crosses between svt2 and Col-0 wild-type 
lines. Molecular analysis of the InDel polymorphism markers 
showed evidence of cryptic but persistent homozygosity 
irrespective of the direction of the sexual cross (highlighted in red). 
However, heterozygosity was expected at all loci. 



Female x Male 
crosses 


InDel 1 
450919 


InDel 2 
451470 


InDel 3 
469762 


InDel 4 
449053 


InDel 5 
455100 


G1F + G2R 
VTC1 


svt2 X Col-0 


1 


H 


H 


L 


H 


H 


H 


svt2 X Col-0 


2 


H 


H 


L 


H 


H 


H 


svt2 X Col-0 F^ 


3 


H 


H 


H 


H 


H 


H 


svt2 X Col-0 F^ 


4 


H 


H 


H 


H 


H 


H 


Col-0 X svt2f^ 


1 


H 


H 


H 


L 


H 


H 


Col-0 X svt2F^ 


2 


H 


H 


L 


H 


H 


H 


Col-0 X svt2f^ 


3 


H 


H 


H 


L 


H 


H 


Col-0 X svt2f^ 


4 


H 


H 


L 


H 


H 


H 



Repeated PCR reactions of reciprocal crosses between svt2 and 
Col-0 wild-type lines 

1 Data File 

http://dx.d0i.0rg/l 0.6084/m9.f igshare.1 03775 



Reciprocal crosses between svt2 and Col-0 wild-type lines 

1 Data File 

http://dx.d0i.0rg/l 0.6084/m9.f igshare.1 03776 



Repeated reciprocal crosses between svt2 and Col-0 wild-type 
lines 

1 Data File 

http://dx.doi.org/10.6084/m9.figshare.103777 



Reciprocal crosses between svt2 and Col-0 wild-type lines 

1 Data File 

http://dx.d0i.0rg/l 0.6084/m9.f igshare.1 03774 



back to svt2-\ikQ plants (Table 3, Table 4; Figure 8, Figure 9). 
(4) We were unable to obtain true F^ heterozygotes in all svt2/ 
Col-0 reciprocal crosses (Table 5). (5) Delayed flowering and 
enlarged morphology phenotypes argue against the fact that svt2 
is a result of a Ler-0 wild-type seed landing on the flat during the 
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initial planting of the vtcl-1 mutagenized population. There 
is the possibility of a L^r seed contamination of the vtcl-1 seed 
stock used for EMS mutagenesis. Although we have sequenced 
the vtcl-1 seed stock used for this experiment and confirmed that 
it is homozygous for the vtcl-1 mutation, one could argue that 
sequencing the seed stock may not be a sensitive enough method 
to rule out contamination with a few L^r seed. We performed 
many other experiments using this very same seed stock and never 
observed L^r-like plants among the vtcl population. However, 
arguments (1) through (4) above speak most compellingly against 
seed contamination. 

Third, the following experimental evidence argues against the 
possibility that svt2 was generated by cross pollination of vtcl-1 
mutant plants with L^r wild-type plants. (1) If svt2 were generated 
by Ler cross-pollination, the InDel polymorphism markers tested 
using svt2 genomic DNA should have indicated heterozygosity. 
This, however, was not the case (Table 4). (2) While svt2 shares 
phenotypic and genotypic characteristics with L^r and Col, it also 
has unique features (Figure 3 A, Figure SI). (3) svt2 exhibits pheno- 
typic and genotypic instability, causing the appearance of revertants 
with persistent reproducibility. (4) L^r plants were not grown in 
our growth chambers at the time of the mutagenesis experiment. 
Furthermore, svt2 was isolated by placing Aracons over the mutant 
plant to allow self-fertilization and seed production. 

Possible causes of genome instability in svt2 

Our results are indicative of genome instability in svt2. Genome 
instability may be a result of polyploidy^^. Polyploids can arise from 
genome duplication (autopolyploids) or interspecific hybridization 
(allopolyploids). Our data suggest that svt2 does not contain multiple 
sets of chromosomes, because VTCl occurs as a single copy gene 
in svt2 and vtcl-1 mutants as well as the Col-0 and wild-type 
controls (Table 2). Furthermore, extra DNA must be replicated with 
each cell division. Therefore, enlarged cell size is often associated 
with polyploids-''. The chemical mutagenesis of vtcl-1 seed could 
have resulted in mutations, which may have led to increased ploidy 
levels in one, two, or all three meristem layers, LI, L2, and L3. 
However, only mutations in the L2 layer, which gives rise to the 
reproductive organs, are inherited. Polyploidy in the L2 layer is 
reflected in pollen size. While svt2 has an overall enlarged mor- 
phology (Figure 3A), its pollen size is comparable to that of the 
other three genotypes (Figure 4). This suggests that svt2 anthers are 
not polyploid. Finally, allopolyploids often display a greater degree 
of heterozygosity-'', low fertility, and low embryonic viability ^^"^^ 
This, however, is not the case in svt2. The fact that svt2 is fertile and 
that its enlarged morphology is heritable from one generation to the 
next suggests that svt2 is neither a somatic nor a gametic polyploid. 
Thus, it is therefore unlikely that polyploidy in svt2 contributes 
to genome instability. This is supported by Ruffio-Chable and 
co-workers, who reported that between 5% and 21% of F^ hybrids 
in Brassica oleracea showed aberrant leaf phenotypes, despite normal 
ploidy levels^'^ 

Instead, we hypothesize that genome instability of svt2 was fur- 
ther aggravated by exposing the already instable genome of vtcl-1 
mutants to EMS. It has recently been shown that plants impaired 
in certain aspects of protection against reactive oxygen species 



have a higher incidence of spontaneous double-strand breaks^^\ The 
AA-deficient vtcl-1 mutant has a three-fold higher spontaneous 
homologous recombination frequency and has a higher incidence of 
double- strand breaks (see below). Similar results were reported for 
the Arabidopsis thaliana flavonoid-deficient mutants tt4 and tt5^^. 
One may speculate that through the high level of stress induced by 
EMS, a yet unknown mechanism of genome restoration was turned 
on. In fact, genome alterations in soybean and flax in response 
to environmental stress have been reported previously In the 
process of soybean cell culture, massive specific changes in numer- 
ous genome- wide loci were observed^ ^ It was suggested that this 
genetic variation is a consequence of specific recombinational 
events. Similarly, in flax a single-copy 5.7 kilobase DNA fragment 
that was not present in the parent line appeared in genotrophs in 
response to particular growth conditions'^. 

Possible mechanisms of genome restoration in svt2 

The experimental evidence described in this work raises the question 
as to what mechanism is responsible for the loss or reintroduction of 
genomic DNA sequences in the original svt2 mutant and its revertant 
offspring. Several mechanisms may be considered: activity of trans- 
posable elements, random mutations, unequal crossing over, gene 
conversion, double- strand breaks and recombination, and activity 
of an RNA cache. 

Transposons are DNA elements capable of moving around the 
genome; movement is often associated with chromosome breaks 
and formation of unstable mutations, which revert frequently but 
often give rise to new phenotypes. Movement of transposable ele- 
ments often occurs during meiosis and mitosis and is accelerated 
by genome damage'l These represent conditions that are present 
in svt2. However, transposons have a variety of molecular features 
that do not apply to svt2. Transposons exist as multiple copies in the 
genome. A blast search of the VTCl promoter insertion in svt2 did 
not return any other hits, indicating that the DNA sequence is not 
present in its entirety anywhere else in the genome. Additionally, 
transposon termini represent inverted repeats. This, however, is not 
the case in svt2 (Figure SI). A short, direct repeat of genomic DNA 
often flanks the transposon, leaving a "footprint". Our sequencing 
analysis of the VTCl promoter region in svt2 did not reveal any 
footprints, suggesting that transposon activity is not responsible for 
the insertion or loss of novel sequences in svt2 (Figure SI). 

Random mutations caused by EMS mutagenesis could have acti- 
vated an unknown mechanism in vtcl-1 seeds, giving rise to the 
phenome and genome instability in svt2. This may explain the 
novel SNPs we detected in svt2 that are distinct from the vtcl-1 
mutant and Col-0 and wild types (Figure SI). The disap- 

pearance of the vtcl-1 mutation in svt2 (Figure 5D, Figure SI) may 
also be explained by the introduction of a random mutation. However, 
it is possible that exposure of vtcl-1 seeds to EMS could have 
reversed the original vtcl-1 mutation to the wild- type sequence, 
as vtcl-1 was initially isolated in an EMS screen^\ Interestingly, 
Conklin and co-workers previously isolated two vtcl alleles, vtcl-1 
and vtcl -2, containing the exact same single cytosine to thymine 
point mutation at amino acid position 64 relative to the start codon, 
despite the fact that vtcl-1 and vtcl -2 mutants were isolated inde- 
pendently from different EMS -mutagenized pools ^\ The authors 
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suggested that a limited number of mutations are tolerable in the 
VTCl enzyme GDP-D-mannose pyrophosphorylase without 
causing embryo lethality. This is supported by the fact that sev- 
eral independently isolated cyt mutant alleles containing differ- 
ent amino acid mutations in VTCl are embryo lethal^^. To date, 
only the vtcl-P^ and hsnl mutations^ have been isolated and 
reportedly do not cause embryo lethality. This suggests some form 
of allelic constraint that has been reported in Arabidopsis previ- 
ously^^'"*^. Furthermore, in the EMS screen in which the svt2 mutant 
was isolated, several other vtcl-1 suppressor mutants with restored 
root development in the presence of ammonium were identified. 
Sequencing analysis revealed that in all of these mutants the vtcl-1 
mutation was restored to the wild-type allele, while the suppres- 
sor mutants neither exhibited a 5v?2-like phenotype nor did they 
produce revertants in the subsequent generation (Kempinski et ah , 
unpublished data). 

Exposure to EMS or y-radiation has been reported to induce high 
frequency phenotypic instability in the Arabidopsis disease resist- 
ance genes CPRl and BAL, which map to the RPP5 locus Yi and 
Richards reported destabilization of phenotypes in both the bal and 
cprl mutants in more than 10% of EMS-treated plants in the M^ 
generation. They also identified exceptions to simple Mendelian 
inheritance in the M^ generation. Phenotypic instability was also 
observed in bal x cprl F^ hybrids. The authors suggested that the 
high degree of phenotypic instability in bal and cprl mutants is 
due to the fact that the RPR5 locus can adopt different metastable 
genetic or epigenetic states, whose stability is highly susceptible to 
mutagenesis and pairing of different alleles. Yi and Richards later 
reported that the phenotypic instability of bal mutants is caused 
mainly by gene duplication and hypermutation of the SNCl gene^^. 

As observed in the cprl and bal mutants, we hypothesize that EMS 
treatment has destabilized the genome of svt2 by interrupting one or 
more mechanisms involved in genomic inheritance. A combination 
of unequal crossing over, gene conversion, double-strand breaks, 
DNA recombination, and/or the presence of an RNA cache template 
may explain the loss and reappearance of DNA sequences in svt2. 
Genome-wide non-Medelian inheritance of extra-genomic informa- 
tion in Arabidopsis was reported in the hothead Qith) Arabidopsis 
mutant^^. Self-fertilization of homozygous mutant plants resulted in 
approximately 10% hth revertants, which were hth/HTH heterozy- 
gous, suggesting that the HTH gene was altered in the progeny. 
However, the authors also detected rare homozygous revertants 
HTH/HTH embryos, which must have inherited one of their two 
wild-type HTH genes from the maternal parent and could not 
have been a result of outcrossing. Inheritable genome- wide high- 
frequency gene homozygosity in early generations in rice has also 
been reported^l Lolle et al. postulated that these genetic restora- 
tion events are the result of a template- directed process that utilizes 



an ancestral RNA- sequence cache^^. This hypothesis is supported 
by observations reported by Xu and co-workers Therefore, our 
genetic and phenotypic svt2 data, in conjunction with the observed 
higher occurrence of double- strand breaks and spontaneous 
homologous recombination frequency in vtcl-1, are in support of 
the RNA cache theory. Additional studies are needed to provide 
experimental support for this hypothesis. 

Conclusions 

We have isolated a novel Arabidopsis mutant that is capable of 
restoring genetic information that was not present in the chromo- 
somal genome of its parents. We suggest that this ancestral infor- 
mation is present in some cryptic form that is accessible under 
extreme stress conditions. Genome restoration could be advanta- 
geous to plants that encounter environmental changes for which 
ancestral genes were better adapted. However, the mechanisms 
responsible for triggering and executing genome restoration 
remain to be determined. Double strand breaks, DNA recombina- 
tion, and/or the activity of an RNA cache may be contributing 
factors. In the future, svt2 may serve as a model to study non- 
Mendelian inheritance and could provide insight into the evolution 
and diversification of Arabidopsis ecotypes. 
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Supplementary material 

1630 1640 1650 1660 1670 1680 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA GGTAAGTCAGTTTTTTTTTTTGGCTAATAAAAACGGTAAAATCATGTTATTGATAAAAAA 
vtcl - l_VTCl_gDNA GGTAAGTCAGTTTTTTTTTTTGGCTAATAAAAACGGTAAAATCATGTTATTGATAAAAAA 
svt2_VTCl_gDNA ^ 
Ler- 0_VTCl_GenBank_gDNA 



1690 1700 1710 1720 1730 1740 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA TTTGAAAACAGTAAAAGAAATATGACCATATTTGAAAATTTACCTAAAAACTATAATGAT 
vt c 1 - 1_VTC l_gDNA TTTGAAAACAGTAAAAGAAATATGACCATATTTGAAAATTTACCTAAAAACTATAATGAT 
svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



1750 1760 1770 1780 1790 1800 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA GTAATAGTTTGATTTACTTACATTTAACTAATTTTTAAGGTCTGTTGATCGAACTCGTTA 
vtcl - l_VTCl_gDNA GTAATAGTTTGATTTACTTACATTTAACTAATTTTTAAGGTCTGTTGATCGAACTCGTTA 
svt2_VTCl_gDNA — - 
Ler-0_VTCl_GenBank_gDNA TCT 



1810 1820 1830 1840 1850 1860 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA CCTTGAACAATTAGTAGGACAACTCAAACCACTAAACCATTATACTTTAAGGATTTATGT 
vtcl - l_VTCl_gDNA CCTTGAACAATTAGTAGGACAACTCAAACCACTAAACCATTATACTTTAAGGATTTATGT 
svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



1870 1880 1890 1900 1910 1920 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA AAAATTTCAAATATATATAGTTTAGTAGATGCACTTATCATCACACTCACCAATTGGATG 
vtcl - l_VTCl_gDNA AAAATTTCAAATATATATAGTTTAGTAGATGCACTTATCATCACACTCACCAATTGGATG 
svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



1930 1940 1950 1960 1970 1980 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA TCAACACCTGGTTCTAGCTTTTTAATTACCAAAGTGAAAAAACTGACTTTTTCTAAAAAA 

vtcl - l_VTCl_gDNA TCAACACCTGGTTCTAGCTTTTTAATTACCAAAGTGAAAAAACTGACTTTTTCTAAAAAA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



1990 2000 2010 2020 2030 2040 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA AATTCGTTCTAGATGGATGCTCTTCAAATTCGTTC TAAAAAAAACTG 

vtcl - l_VTCl_gDNA AATTCGTTCTAGATGGATGCTCTTCAAATTCGTTC TAAAAAAAACTG 
svt2_VTCl_gDNA — TCTAGATGGATGCTCTTTAAAT C TTCGATATTTTTTATCCGTTTCGATAAT 

Ler-0_VTCl_GenBank_gDNA TCTAGATGGATGCTCTTTAAAT C TTCGATATTTTTTATCCGTTTCGATAAT 

2050 2060 2070 2080 2090 2100 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA ATGGTAAGAATGAATGACGAATCGGTCAAGCTAATCTGTATATTAATCATTGTACTCATC 
Ler-0_VTCl_GenBank_gDNA ATGGTAAGAATGAATGACGAATCGGTCAAGCTAATCTGTATATTAA^CATTGTACTCATC 

2110 2120 2130 2140 2150 2160 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA AACGTAAAGTCCTATTCGTCTATACATATGTGAACTTATATATGTCTATCAACTAGTTCA 
Ler-0_VTCl_GenBank_gDNA AACGTAAAGTCCTATTCGTCTATACATATGTGAACTTATATATGTCTATCAACTAGTTCA 
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Col-0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

s V 1 2_VTC l_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

s V 1 2_VTC l_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col-0_VTCl_TAIR_gDNA 
vtcl - l_VTCl_gDNA 
svt2_VTCl_gDNA 
Ler-0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



2170 
I I I 



2180 
. . I I 



2190 
. . I I 



2200 
. . I I 



2210 2220 
. . I I I 



ctaccctatatataaagttcatcagagttgtcgatcagcagtgaccactacacattcttS 
ctaccctat aaagttcatcagagttgtcgatcagcagtgaccactacacattcttc 

kkkk ^ 

2230 2240 2250 2260 2270 2280 



I 



I 



I 



I 



I 



I 



I 



I 



I 



I 



I 



I 

ACATAATA 

ACATAATA 

ATACAGCTGAGTTAGGAATGTTAACAAAATAGTTTATGGGAGTATGTTTTATACATAATA 
ATACAGCTGAGTTAGGAATGTTAACAAAATAGTTTATGGGAGTATGTTTTATACATAATA 

2290 2300 2310 2320 2330 2340 

I I I I I I I I I I I I 

CCATCCCTTTAAAAACACAGAATTTTTTTTATCATCTCTGAAACAAATCATTTACAGTAG 
CCATCCCTTTAAAAACACAGAATTTTTTTTATCATCTCTGAAACAAATCATTTACAGTAG 
CCATCCCTTTAAAAACACAGAATTTTCTTTATCATCTCCGAAACAAATCATTTACAGTAG 
CCATCCCTTTAAAAACACAGAATTTTCTTTATCATCTCCGAAACAAATCATTTACAGTAG 

2350 2360 2370 2380 2390 2400 

I I I I I I I I I I I I 

TAAATGT CAACACAACATTAATTCTGTTTGTTGTTGGCATTTACAATTGCAAAATCAT 
TAAATGT CAACACAACATTAATTCTGTTTGTTGTTGGCATTTACAATTGCAAAATCAT 
TAAATGTAAAAACACAACATTAATTCTGTTTGTTGTTGGCATTTACAATTGCAAAATCAT 
TAAATGTAAAAACACAACATTAATTCTGTTTGTTGTTGGCATTTACAATTGCAAAATCAT 



III 



2410 2420 2430 2440 2450 2460 

I I I I I I I I I I I I 

TTTCTCATTTATTATTCGTATTTATTTTGTCAAGAACCCTTGTCTCTAAAATATTCATAG 
TTTCTCATTTATTATTCGTATTTATTTTGTCAAGAACCCTTGTCTCTAAAATATTCATAG 
TTTCTCATTTATTATTCGTATTTATTTTGTCAAGAACCCTTGTCTCTAAAATATTCATAG 
TTTCTCATTTATTATTCGTATTTATTTTGTCAAGAACCCTTGTCTCTAAAATATTCATAG 

2470 2480 2490 2500 2510 2520 

I I I I I I I I I I I I 

AAAAAGAAAAGAGCCATTAATTAATGGCTTGAAGAAAGATTGGTGTATAAGCGTCTACGT 
AAAAAGAAAAGAGCCATTAATTAATGGCTTGAAGAAAGATTGGTGTATAAGCGTCTACGT 
AAAAA AAAAGAGCCATTAATTAATGGCTTGAAGAAAGATTGGTGTATAAGCGTCTACGT 
AAAAA AAAAGAGCCATTAATTAATGGCTTGAAGAAAGATTGGTGTATAAGCGTCTACGT 

2530 2540 2550 2560 2570 |2580 

I I I I I I I I I I I .... I 

GACCTTTAATTAATTTACTTCCCCCAAAAAAGTCAACATTCAACATGTGAATAAAAA TC 
GACCTTTAATTAATTTACTTCCCCCAAAAAAGTCAACATTCAACATGTGAATAAAAA TC 
GACCTTTAATTAATTTACTTCCCCAAAAAAAGTCAACATTCAACATGTGAATAAAAA TC 
GACCTTTAATTAATTTACTTCCCCAAAAAAAGTCAACATTCAACATGTGAATAAAAAATC 

f 

2590 2600 2610 2620 2630 2640 

I I I I I I I I I I I I 

AATATTGGTTTCTAAGTAAGTAAGTACCATATTATTAAATTATTTATTTTGGTAAATACG 
AATATTGGTTTCTAAGTAAGTAAGTACCATATTATTAAATTATTTATTTTGGTAAATACG 
AATATTGGTTTCTAAGTAAGTAAGTACCATATTATTAAATTATTTATTTTGGTAAATACG 
AATATTGGTTTCTAAGTAAGTAAGTACCATATTATTAAATTATTTATTTTGGTAAATACG 



2650 2660 

I I I I I 

CACTCAATTTTTCTCTCAACGGTG 
CACTCAATTTTTCTCTCAACGGTG 



2670 2680 2690 2700 

. . I I I I I I I 

TATATAAACAAAAGGAGTCTCCTTTGGAAAAA 
TATATAAACAAAAGGAGTCTCCTTTGGAAAAA 



CACTCAATTTTTCTCTCAACGGTGGGTGTATATAAACAAAAGGAGTCTCCTTTGGAAAAA 
CACTCAATTTTTCTCTCAACGGTGGGTGTATATAAACAAAAGGAGTCTCCTTTGGAAAAA 



III! 
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Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



2710 2720 2730 2740 2750 | 2760 

I I I I I I I I I I |! . . . I 

ACTTGCCTATCATTTTGCCAACGAACGTTCTTTCTTCTTAATCACAGCTCAGCCTGACGC 
ACTTGCCTATCATTTTGCCAACGAACGTTCTTTCTTCTTAATCACAGCTCAGCCTGACGC 
ACTTGCCTATCATTTTGCCAACGAACGTTCTTTCTTCTTAATCACAGCTCAGCCTGACGC 
ACTTGCCTATCATTTTGCCAACGAACGTTCTTTCTTCTTAATCACAGCTCAGCCT ACGC 

2770 2780 2790 2800 2810 2820 

I I I I I I I I I I I I 

AACCGCTCAGGCTGATCTCTTCCAATTTACAGCCATTTCCCAGCTCAGATCTCTGATCCG 
AACCGCTCAGGCTGATCTCTTCCAATTTACAGCCATTTCCCAGCTCAGATCTCTGATCCG 
AACCGCTCAGGCTGATCTCTTCCAATTTACAGCCATTTCCCAGCTCAGATCTCTGATCCG 
AACCGCTCAGGCTGATCTCTTCCAATTTACAGCCATTTCCCAGCTCAGATCTCTGATCCG 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl__gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



2830 2840 2850 2860 2870 2880 

I I I I I I I I I I I I 

GTGAGATCTCTCTCAAGGTAATGCCCCTGCAATTTTGCTTACTTCTCTGGTTGTGATATG 
GTGAGATCTCTCTCAAGGTAATGCCCCTGCAATTTTGCTTACTTCTCTGGTTGTGATATG 
GTGAGATCTCTCTCAAGGTAATGCCCCTGCAATTTTGCTTACTTCTCTGGTTGTGATATG 
GTGAGATCTCTCTCAAGGTAATGCCCCTGCAATTTTGCTTACTTCTCTGGTTGTGATATG 



2890 2900 2910 2920 2930 2940 

I I I I I I I I I I .Y . . I I 

CATGTTCTTCGAATTTTCATCGTTTGTGATTTGAATTCTCATTTTGTATTTGCTGTTGTT 
CATGTTCTTCGAATTTTCATCGTTTGTGATTTGAATTCTCATTTTGTATTTGCTGTTGTT 
CATGTTCTTCGAATTTTCATCGTTTGTGATTTGAATTCTCATTTTGTATTTGCTGTTGTT 
CATGTTCTTCGAATTTTCATCGTTTGTGATTTGAATTCTCATTTTGTATTT CTGTTGTT 



Col - 0_VTC l_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



2950 2960 2970 2980 f 2990 3000 

I I I I I I I I I I I I 

GGTTTTTAATTCGATTTTCCGGAACAGATTATGGGATTT GTATTCGAATCTTCGATTTG 
GGTTTTTAATTCGATTTTCCGGAACAGATTATGGGATTT GTATTCGAATCTTCGATTTG 
GGTTTTTAATTCGATTTTCCGGAACAGGTTATGGGGATTTGTATTCGAATCTTCGATTTG 
GGTTTTTAATTCGATTTTCCGGAACAGGTTATGGGGATTTGTAT CGAATCTTCGATTTG 

I HI 

3010 3020 3030 3040 3050 3060 

I I I I I I I I I I I I 

ATGACATAATGTCCCAGCCTTTTATGTTTAATCTTGAAATGATGGACTTTTATCCGATCT 
ATGACATAATGTCCCAGCCTTTTATGTTTAATCTTGAAATGATGGACTTTTATCCGATCT 
ATGACATAATGTCCCAGCCATTTATGTTTAATCTTGAAATGATGGACTTTAATCCGATCT 
ATGACATAATGTCCCAGCCATTTATGTTTAATCTTGAAATGATGGACTTTAATCCGATCT 

3070 3080 3090 3100 3110 3120 

I I I I I I I I I I I I 

GGGTTTAAAGCTGGAATTTTGATTGTGGGTACTATTAGGTTTCATTGATTTATTGCTTGG 
GGGTTTAAAGCTGGAATTTTGATTGTGGGTACTATTAGGTTTCATTGATTTATTGCTTGG 
GGGTTTAAAGCTGGAATTTTGATTGTGGGTACTATTAGGTTTCATTGATTTATTGCTTGG 
GGGTTTAAAGCTGGAATTTTGATTGTGGGTACTATTAGGTTTCATTGATTTATTGCTTGG 

3130 3140 3150 3160 3170 3180 

I I I I I I I I I I I I 

TCCAACATTTTTAGCAGCTGGTATTGAGCTCTTGTTGTCTGAATTTTGGAAAGAACTATT 
TCCAACATTTTTAGCAGCTGGTATTGAGCTCTTGTTGTCTGAATTTTGGAAAGAACTATT 
TCCAACATTTTTAGCAGCTGGTATTGAGCTCTTGTTGTCTGAATTTTGGAAAGAACTATT 
TCCAACATTTTTAGCAGCTGGTATTGAGCTCTTGTTGTCTGAATTTTGGAAAGAACTATT 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



3190 3200 3210 3220 3230 3240 

I I I I I I I I I I I I 

TTTGTTGTATCGTTTTGATTTATTTGGATCTGAATTCATTCACCTTTTTCTCTGATTATT 
TTTGTTGTATCGTTTTGATTTATTTGGATCTGAATTCATTCACCTTTTTCTCTGATTATT 
TTTGTTGTATCGTTTTGATTTATTTGGATCTGAATTCATTCACCTTTTTCTCTGATTATT 
TTTGTTGTATCGTTTTGATTTATTTGGATCTGAATTCATTCACCTTTTTCTCTGATTATT 
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Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



3250 



3260 



3270 



3280 



3290 



3300 



I I I I I I I I I I I I 

GTTTTGTGTCGGTTGCATCCACTTTGATTAGATCTGAATGAATCATTTTTTTATGTGCTC 
GTTTTGTGTCGGTTGCATCCACTTTGATTAGATCTGAATGAATCATTTTTTTATGTGCTC 
GTTTTGTGTCGGTTGCATCCACTTTGATTAGATCTGAATGAATCATTTTTTTATGTGCTC 
GTTTTGTGTCGGTTGCATCCACTTTGATTAGATCTGAATGAATCATTTTTTTATGTGCTC 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



3310 3320 3330 3340 3350 3360 

I I I I I I I I I I I I 

AAGTTATTGTATGGATTGTTCTGTTTCTAGCATGTTTTGGTTAGACATTGTTAAGATCTG 
AAGTTATTGTATGGATTGTTCTGTTTCTAGCATGTTTTGGTTAGACATTGTTAAGATCTG 
AAGTTATTGTATGGATTGTTCTGTTTCTAGCATGTTTTGGTTAGACATTGTTAAGATCTG 
AAGTTATTGTATGGATTGTTCTGTTTCTAGCATGTTTTGGTTAGACATTGTTAAGATCTG 



3370 3380 | 3390 3400 3410 3420 

— I — I — I — I — r. . . . I I I I I I I 

ACGTTTGCATTTTCAGGAAAAGGAG TTAGAGCATCATCAAGATGAAGGCACTCATTCTT 
ACGTTTGCATTTTCAGGAAAAGGAG TTAGAGCATCATCAAGATGAAGGCACTCATTCTT 
ACGTTTGCATTTTCAGGAAAAGGAG TTAGAGCATCATCAAGATGAAGGCACTCATTCTT 



Col - 0_VTCl_TAIR_gDNA 
vtcl - l_VTCl_gDNA 
svt2_VTCl_gDNA 

Ler-0_VTCl_GenBank_gDNA ACGTTTGCATTTTCAGGAAAAGGAGCTTAGAGCATCATCAAGATGAAGGCACTCATTCTT 



3430 3440 3450 3460 3470 3480 

I I I I I I I I I I I I 

GTTGGAGGCTTCGGCACTCGCTTGAGACCATTGACTCTCAGTTTCiCAAAGCCCCTTGTT 
GTTGGAGGCTTCGGCACTCGCTTGAGACCATTGACTCTCAGTTTclcAAAGCCCCTTGTT 
GTTGGAGGCTTCGGCACTCGCTTGAGACCATTGACTCTCAGTTTclcAAAGCCCCTTGTT 



Col - 0_VTCl_TAIR_gDNA 
vtcl - l_VTCl_gDNA 
svt2_VTCl_gDNA 

Ler-0_VTCl_GenBank_gDNA GTTGGAGGCTTCGGCACTCGCTTGAGACCATTGACTCTCAGTTTC|CAAAGCCCCTTGTT 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



3490 3500 3510 3520 3530 3540 

I I I I I I I I I I I I 

GATTTTGCTAATAAACCCATGATCCTTCATCAGGTAATCTATCTTAAATTTGCCGCTTTA 
GATTTTGCTAATAAACCCATGATCCTTCATCAGGTAATCTATCTTAAATTTGCCGCTTTA 
GATTTTGCTAATAAACCCATGATCCTTCATCAGGTAATCTATCTTAAATTTGCCGCTTTA 
GATTTTGCTAATAAACCCATGATCCTTCATCAGGTAATCTATCTTAAATTTGCCGCTTTA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



3550 3560 3570 3580 3590 3600 

I I I I I I I I I I I I 

GTCTGCCAGTTCTTACCTATGCCTATGTTTGAACCGAGGCATGTTTTCTTGTAGATAGAG 
GTCTGCCAGTTCTTACCTATGCCTATGTTTGAACCGAGGCATGTTTTCTTGTAGATAGAG 
GTCTGCCAGTTCTTACCTATGCCTATGTTTGAACCGAGGCATGTTTTCTTGTAGATAGAG 
GTCTGCCAGTTCTTACCTATGCCTATGTTTGAACCGAGGCATGTTTTCTTGTAGATAGAG 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



3610 3620 3630 3640 3650 3660 

I I I I I I I I I I I I 

GCTCTTAAGGCAGTTGGAGTTGATGAAGTGGTTTTGGCCATCAATTATCAGCCAGAGGTA 
GCTCTTAAGGCAGTTGGAGTTGATGAAGTGGTTTTGGCCATCAATTATCAGCCAGAGGTA 
GCTCTTAAGGCAGTTGGAGTTGATGAAGTGGTTTTGGCCATCAATTATCAGCCAGAGGTA 
GCTCTTAAGGCAGTTGGAGTTGATGAAGTGGTTTTGGCCATCAATTATCAGCCAGAGGTA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



3670 3680 3690 3700 3710 3720 

I I I I I I I I I I I I 

AGATACTAATCTCTCTTAACTTTTTTTTTTGCAGCTATTTTCTGTTTACATATGTTTGTA 
AGATACTAATCTCTCTTAACTTTTTTTTTTGCAGCTATTTTCTGTTTACATATGTTTGTA 
AGATACTAATCTCTCTTAACTTTTTTTTTTGCAGCTATTTTCTGTTTACATATGTTTGTA 
AGATACTAATCTCTCTTAACTTTTTTTTTTGCAGCTATTTTCTGTTTACATATGTTTGTA 



Col - 0_VTCl_TAIR_gDNA 

vtcl - l_VTCl_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



3730 3740 3750 3760 3770 3780 

I I I I I I I I I I I I 

TTTACCATTTGCTCTGTTTCGACAGGTGATGCTGAACTTCTTGAAGGACTTTGAGACCAA 
TTTACCATTTGCTCTGTTTCGACAGGTGATGCTGAACTTCTTGAAGGACTTTGAGACCAA 
TTTACCATTTGCTCTGTTTCGACAGGTGATGCTGAACTTCTTGAAGGACTTTGAAACCAA 
TTTACCATTTGCTCTGTTTCGACAGGTGATGCTGAACTTCTTGAAGGACTTTGAGACCAA 
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3790 3800 3810 3820 3830 3840 

I I I I I I I I I I I I 

Col-0_VTCl_TAIR_gDNA GCTGGAAATCAAAATCACTTGCTCACAAGAGACCGAGCCACTAGGTACCGCTGGTCCTCT 
vtc 1 - 1_VTC l_gDNA GCTGGAAATCAAAATCACTTGCTCACAAGAGACCGAGCCACTAGGTACCGCTGGTCCTCT 
svt2_VTCl_gDNA GCTGGAAATCAAAATCACTTGCTCACAAAAAACCGAGCCACTAGGTACCGCTGGTCCTCT 
Ler-0 VTCl GenBank gDNA GCTGGAAATCAAAATCACTTGCTCACAAGAGACCGAGCCACTAGGTACCGCTGGTCCTCT 

- - - A A 

3850 H^^^^^H^^^^^^^^^|^^^8 0 3890 3900 

I I I I I I I I I I I I 

Col - 0_VTC l_TAIR_gDNA GGCTCTAGCGAGAG ACAAATTGCTTGATGGATCT 

vtcl-l_VTCl_gDNA GGCTCTAGCGAGAG ACAAATTGCTTGATGGATCT 

svt2_VTCl_gDNA GGCTCTAGCGAGAG ACAAATTGCTTGATGGATCT 

Ler-0 VTCl GenBank gDNA GGCTCTAGCGAGAGGTTGGCGTAATCATGGTCATAGCTAGACAAATTGCTTGATGGATCT 

" " " I 

3910 3920 3930 3940 3950 3960 

I I I I I I I I I I I I 

Col - 0_VTCl_TAIR_gDNA GGAGAGCCCTTCTTTGTTCTTAACAGTGATGTGATTAGTGAGTACCCTCTTAAAGAAATG 
vtcl - l_VTCl_gDNA GGAGAGCCCTTCTTTGTTCTTAACAGTGATGTGATTAGTGAGTACCCTCTTAAAGAAATG 
svt2_VTCl_gDNA GGAAAGCCCTTCTTTGTTCTTAACAGTGATGTGATTAGTGAGTACCCTCTTAAAGAAATG 
Ler-0_VTCl_GenBank_gDNA GGAGAGCCCTTCTTTGTTCTTAACAGTGATGTGATTAGTGAGTACCCTCTTAAAGAAATC 

3970 3980 3990 4000 4010 | 4020 

I I I I I I I I I I I I 

Col-0_VTCl_TAIR_gDNA CTTGAGTTTCACAAATCTCACGGTGGGGAAGCCTCCATAATGGTAACAAAGGTGAGATTA 
vtcl - l_VTCl_gDNA CTTGAGTTTCACAAATCTCACGGTGGGGAAGCCTCCATAATGGTAACAAAGGTGAGATTA 
svt2_VTCl_gDNA CTTGAGTTTCACAAATCTCACGGTGGGGAAGCCTCCATAATGGTAACAAAGGTGAGATTA 
Ler- 0_VTCl_GenBank_gDNA CTTGAGTTTCACAAATCTCACGGTGGGGAAGCCTCCATAATGGTAACAAAGGTG GATTA 

4030 4040 4050 4060 4oto 4080 

I I I I I I I I I I I I 

Col-0_VTCl_TAIR_gDNA TCGAAACATAATACTCTCCAGTTACGAGATAAGTACGTTATTCATCTAATGTGGACTTGC 
vtc 1 - 1_VTC l_gDNA TCGAAACATAATACTCTCCAGTTACGAGATAAGTACGTTATTCATCTAATGTGGACTTGC 
svt2_VTCl_gDNA TCGAAACATAATACTCTCCAGTTACGAGATAAGTACGTTATTCATCTAATGTGGACTTGC 
Ler - 0_VTC l_GenBank_gDNA TCGAAACATAATACTCTCCAGTTACGAGATAAGTACGTTATTCATCTAATCTGGACTTGC 



Col - 0_VTCl_TAIR_gDNA 
vtcl - l_VTCl_gDNA 
s vt 2_VTC l_gDNA 
Ler-0_VTCl_GenBank_gDNA 



4090 4100 4110 4120 4130 4140 

I I I I I I I I I I I I 

ATGTATTGGTTATATAGGTGGATGAACCGTCGAAATATGGAGTGGTTGTTATGGAAGAAA 
ATGTATTGGTTATATAGGTGGATGAACCGTCGAAATATGGAGTGGTTGTTATGGAAGAAA 
ATGTATTGGTTATATAGGTGGATGAACCGTCGAAATATGGAGTGGTTGTTATGGAAGAAA 
ATGTATTGGTTATATAGGTGGATGAACCGTCGAAATATGGAGTGGTTGTTATGGAAGAAA 



Col - 0_VTC l_TAIR_gDNA 
vtc 1 - 1_VTC l_gDNA 
s V 1 2_VTC l_gDNA 
Ler-0_VTCl_GenBank_gDNA 



4150 4160 4170 4180 4190 4200 

I I I I I I I I I I I I 

GCACTGGAAGAGTGGAGAAGTTTGTGGAAAAGCCAAAACTGTATGTAGGTAACAAGATCA 
GCACTGGAAGAGTGGAGAAGTTTGTGGAAAAGCCAAAACTGTATGTAGGTAACAAGATCA 
GCACTGGAAGAGTGGAGAAGTTTGTGGAAAAGCCAAAACTGTATGTAGGTAACAAGATCA 
GCACTGGAAGAGTGGAGAAGTTTGTGGAAAAGCCAAAACTGTATGTAGGTAACAAGATCA 



Col-0_VTCl_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



4210 4220 4230 4240 4250 4260 

I I I I I I I I I I I I 

ACGCTGGGATTTATCTTCTGAACCCATCTGTTCTTGATAAGATTGAGCTAAGACCGACTT 
ACGCTGGGATTTATCTTCTGAACCCATCTGTTCTTGATAAGATTGAGCTAAGACCGACTT 
ACGCTGGGATTTATCTTCTGAACCCATCTGTTCTTGATAAGATTGAGCTAAGACCGACTT 
ACGCTGGGATTTATCTTCTGAACCCATCTGTTCTTGATAAGATTGAGCTAAGACCGACTT 



Col - 0_VTC l_TAIR_gDNA 
vtc 1 - 1_VTC l_gDNA 
s V 1 2_VTC l_gDNA 
Ler-0_VTCl_GenBank_gDNA 



4270 4280 4290 4300 i 4310 4320 

I I I I I I I I IT. . . I I I 

CAATCGAAAAAGAGACTTTCCCTAAGATTGCAGCAGCGCAAGGGCTCTATGCTATGGTGC 
CAATCGAAAAAGAGACTTTCCCTAAGATTGCAGCAGCGCAAGGGCTCTATGCTATGGTGC 
CAATCGAAAAAGAGACTTTCCCTAAGATTGCAGCAGCGCAAGGGCTCTATGCTATGGTGC 
CAATCGAAAAAGAGACTTTCCCTAAGATTGCAGCAGCGCAAGGGCACTATGCTATGGTGC 



Page 19 of 28 



FIOOOResearch 2013, 2:6 Last updated: 23 JAN 2014 



Col - 0_VTC l_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



4330 



4340 



4350 



4360 



4370 



4380 



I I I I I I I I I I I I 

TACCAGGGTTTTGGATGGACATTGGGCAACCCCGTGACTACATAACGGGTTTGAGACTCT 
TACCAGGGTTTTGGATGGACATTGGGCAACCCCGTGACTACATAACGGGTTTGAGACTCT 
TACCAGGGTTTTGGATGGACATTGGGCAACCCCGTGACTACATAACGGGTTTGAGACTCT 
TACCAGGGTTTTGGATGGACATTGGGCAACCCCGTGACTACATAACGGGTTTGAGACTCT 



Col - 0_VTC l_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



4390 4400 4410 4420 4430 4440 

I I I I I I I I I I I I 

ACTTAGACTCCCTTAGGAAGAAATCTCCTGCCAAATTAACCAGTGGGCCACACATAGTTG 
ACTTAGACTCCCTTAGGAAGAAATCTCCTGCCAAATTAACCAGTGGGCCACACATAGTTG 
ACTTAGACTCCCTTAGGAAGAAATCTCCTGCCAAATTAACCAGTGGGCCACACATAGTTG 
ACTTAGACTCCCTTAGGAAGAAATCTCCTGCCAAATTAACCAGTGGGCCACACATAGTTG 



Col - 0_VTC l_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



4450 4460 4470 4480 4490 4500 

I I I I I I I I I I I I 

GGAATGTTCTTGTTGACGAAACCGCTACAATTGGGGAAGGATGTTTGATTGGACCAGACG 
GGAATGTTCTTGTTGACGAAACCGCTACAATTGGGGAAGGATGTTTGATTGGACCAGACG 
GGAATGTTCTTGTTGACGAAACCGCTACAATTGGGGAAGGATGTTTGATTGGACCAGACG 
GGAATGTTCTTGTTGACGAAACCGCTACAATTGGGGAAGGATGTTTGATTGGACCAGACG 



Col - 0_VTC l_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank__gDNA 



4510 4520 4530 4540 4550 4560 

I I I I I I I I I I I I 

TTGCCATTGGTCCAGGCTGCATTGTTGAGTCAGGAGTCAGACTCTCCCGATGCACGGTCA 
TTGCCATTGGTCCAGGCTGCATTGTTGAGTCAGGAGTCAGACTCTCCCGATGCACGGTCA 
TTGCCATTGGTCCAGGCTGCATTGTTGAGTCAGGAGTCAGACTCTCCCGATGCACGGTCA 
TTGCCATTGGTCCAGGCTGCATTGTTGAGTCAGGAGTCAGACTCTCCCGATGCACGGTCA 



Col - 0_VTCl_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTC l_GenBank_gDNA 



4570 4580 4590 4600 4610 4620 

I I I I I I I I I I I I 

TGCGTGGAGTCCGCATCAAGAAGCATGCGTGTATCTCGAGCAGTATCATCGGGTGGCACT 
TGCGTGGAGTCCGCATCAAGAAGCATGCGTGTATCTCGAGCAGTATCATCGGGTGGCACT 
TGCGTGGAGTCCGCATCAAGAAGCATGCGTGTATCTCGAGCAGTATCATCGGGTGGCACT 
TGCGTGGAGTCCGCATCAAGAAGCATGCGTGTATCTCGAGCAGTATCATCGGGTGGCACT 



Col - 0_VTCl_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTCl_GenBank_gDNA 



4630 4640 4650 4660 4670 468 

I I I I I I I I I I I I 

CAACGGTTGGTCAATGGGCCAGGATCGAGAACATGACGATCCTCGGTGAGGATGTTCATG 
CAACGGTTGGTCAATGGGCCAGGATCGAGAACATGACGATCCTCGGTGAGGATGTTCATG 
CAACGGTTGGTCAATGGGCCAGGATCGAGAACATGACGATCCTCGGTGAGGATGTTCATG 
CAACGGTTGGTCAATGGGCCAGGATCGAGAACATGACGATCCTCGGTGAGGATGTTCATG 



Col - 0_VTC l_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler- 0_VTC l_GenBank_gDNA 



Col - 0_VTC l_TAIR_gDNA 

vtc 1 - 1_VTC l_gDNA 

svt2_VTCl_gDNA 

Ler - 0_VTC l_GenBank_gDNA 



4690 4700 4710 4720 4730 4740 

I I I I I I I I I I I I 

TGAGCGATGAGATCTATAGCAATGGAGGAGTTGTTTTGCCACACAAGGAGATCAAATCAA 
TGAGCGATGAGATCTATAGCAATGGAGGAGTTGTTTTGCCACACAAGGAGATCAAATCAA 
TGAGCGATGAGATCTATAGCAATGGAGGAGTTGTTTTGCCACACAAGGAGATCAAATCAA 
TGAGCGATGAGATCTATAGCAATGGAGGAGTTGTTTTGCCACACAAGGAGATCAAATCAA 

4750 4760 4770 4780 || 4790 4800 

I I I I I I I I I I I I 

ACATCTTGAAGCCAGAGATAGTGATGTGAAAATGAGATATTATA TGTGCAACTTTTTT 
ACATCTTGAAGCCAGAGATAGTGATGTGAAAATGAGATATTATA TGTGCAACTTTTTT 
ACATCTTGAAGCCAGAGATAGTGATGTGAAAATGAGATATTATA TGTGCAACTTTTTT 
ACATCTTGAAGCCAGAGATAGTGATGTGAAAATGAGATATTATATATGTGCAACTTTTTT 



Col - 0_VTC l_TAIR_gDNA 
vtc 1 - 1_VTC l_gDNA 
svt2_VTCl_gDNA 



4810 4820 4830 4840 4850 4860 

I I I I I I I I I I I I 

TTTTTTTTTT GTGTCCTTTCTTCAACTTTGAAATCGCTTTCGTAATTCTTAATGGCTTT 
TTTTTTTTTT GTGTCCTTTCTTCAACTTTGAAATCGCTTTCGTAATTCTTAATGGCTTT 



Ler-0_VTCl_GenBank_gDNA TTTTTTTTTTTGTGTCCTTTCTTCAACTTTGAAATCGCTTTCGTAATTCTTAATGGCTTT 



Figure S1. Sequence alignment of the VTC1 gene sequence of the Col-0 TAIR database, the vtc1-1, svt2 mutants, and the Ler-0 
Gen Bank database. Horizontal arrows denote 5' respectively 3' flanking regions of the sequence insertion, which is highlighted in grey, in 
the VTC1 promoter region (between base pairs 1990 and 2273). Upright arrows indicate sequences shared between svt2 and Ler Arrows 
pointing down denote sequences shared between svt2 and Col. Arrowheads point to sequences unique to svt2. Highlighted in yellow are the 
start and stop codons, respectively Highlighted in green is the vtc1-1 mutation. 
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1760 1770 1780 1790 1800 

I I I I I I I I I I 

Col - 0_VTCl_gDNA_TAIR GATTTACTTACATTTAACTAATTTTTAAGGTCTGTTGATCGAACTCGTTA 

Ler - 0_VTC l_gDNA_Genbank ^^^^^^^ 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4_G1F 

1810 1820 1830 1840 1850 

I I I I I I I I I I 

Col - 0_VTCl_gDNA_TAIR CCTTGAACAATTAGTAGGACAACTCAAACCACTAAACCATTATACTTTAA 
Ler - 0_VTC l_gDNA_Genbank 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4 GIF 



1860 1870 1880 1890 1900 

I I I I I I I I I I 

Col - 0_VTCl_gDNA_TAIR GGATTTATGTAAAATTTCAAATATATATAGTTTAGTAGATGCACTTATCA 
Ler-0_VTCl_gDNA_Genbank 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4_G1F 

1910 1920 1930 1940 1950 

I I I I I I I I I I 

Col - 0_VTCl_gDNA_TAIR TCACACTCACCAATTGGATGTCAACACCTGGTTCTAGCTTTTTAATTACC 
Ler- 0_VTCl_gDNA_Genbank 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4_G1F 

1960 1970 1980 1990 2000 

I I I I I I I I I I 

Col - 0_VTCl_gDNA_TAIR AAAGTGAAAAAACTGACTTTTTCTAAAAAAAATTCGTTCTAGATGGATGC 
Ler- 0_VTCl_gDNA_Genbank TCGNATCTAGATGGATGC 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4 GIF 



2010 2020 2030 2040 2050 

I I I I I I I I I I 

Col-0_VTCl_gDNA_TAIR TCTTCAAATTCGTTCTAAAAAAAACTGA 

Ler- 0_VTCl_gDNA_Genbank TCTTTAAAT CTTCGATATTTTTTATCCGTTTCGATAATATGGTAAGAAT 

svt2 Kl Col R M3_G1F ATAGTCGCTCTAGAATGTGA 

svt2 Kl Col R svt2 R M4_G1F ATCAGCTTCTAGCATGGATGCTCCGTTTCGATAATAATGTGAGAAT 

2060 2070 2080 2090 2100 
I I I I I I I I I I 

Col - 0_VTCl_gDNA_TAIR 

Ler- 0_VTCl_gDNA_Genbank GAATGACGAATCGGTCAAGCTAATCTGTATATTAA CATTGTACTCATCA 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4_G1F GAATGACGAATCGGTCAAGCTAATCTGTATATTAATCATTGTACTCATCA 

2110 2120 2130 2140 2150 

I I I I I I I I I I 

Col-0_VTCl_gDNAjrAIR .^.^ 

Ler-0_VTCl_gDNA_Genbank ACGTAAAGTCCTATTCGTCTATACATATGTGAACTTATATATGTCTATCA 
svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4_G1F ACGTAAAGTCCTATTCGTCTATACATATGTGAACTTATATATGTCTATCA 

2160 2170 2180 2190 2200 

I I I I I I I I I I 

Col - 0_VTCl_gDNAjrAIR ^ ^ ^ ^ ^ ^ ^ ^, ... ... 

Ler- 0_VTCl_gDNA_Genbank ACTAGTT CACTACCCTATAAAGTTCATCAGAGTTGTCGATCAGCAGTG 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4 GIF ACTAGTTCACACTACCCTATAAAGTTCATCAGAGTTGTCGATCAGCAGTG 
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Col - 0_VTCl_gDNA_TAIR 

Ler- 0_VTCl_gDNA_Genbank 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4 GIF 



Col - 0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 



svt2 Kl Col 
svt2 Kl Col 



R M3_G1F 

R svt2 R M4 GIF 



2210 
I I I 



2220 
. . I I 



2230 
. . I I 



2240 2250 
. . I I I 



ACCACTACACATTCTTCATACAGCTGAGTTAGGAATGTTAACAAAATAGT 

ACCACTACACATTCTTCATACAGCTGAGTTAGGAATGTTAACAAAATAGT 

> 

2260 2270 2280 2290 2300 

I I I I I I I I I I 

ACATAATACCATCCCTTTAAAAACACAGAAT 
TTATGGGAGTATGTTTTATACATAATACCATCCCTTTAAAAACACAGAAT 
CATAATACCATCCCTTTAAAA CACAGAAT 
TTATGGGAGTATGTTTTATACATAATACCATCCCTTTAAAAACACAGAAT 



Col - 0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 
svt2 Kl Col R M3 GIF 



2310 2320 2330 2340 2350 

I I I I I I I I I I 

TTTTTTTATCATCTCTGAAACAAATCATTTACAGTAGTAAATGTCAA C 
TTTCTTTATCATCTCCGAAACAAATCATTTACAGTAGTAAATGTAAAAAC 
TTTTTTTATCATCTCTGAAACAAATCATTTACAGTAGTAAATGTCAA C 



svt2 Kl Col R svt2 R M4 GIF TTTCTTTATCATCTCCGAAACAAATCATTTACAGTAGTAAATGTAAAAAC 



Col - 0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 
svt2 Kl Col R MS GIF 



2360 2370 2380 2390 2400 

I I I I I I I I I I 

ACAACATTAATTCTGTTT GTTGTTGGCATTTACAATTGCAAAATCATTT 
ACAACATTAATTCTGTTT GTTGTTGGCATTTACAATTGCAAAATCATTT 
ACAACATTAATTTGGTTTTGTTGTTGGCATTTACAATTGCAAAATCATTT 



svt2 Kl Col R svt2 R M4 GIF ACAACATTAATTCTGTTT GTTGTTGGCATTTACAATTGCAAAATCATTT 



Col - 0_VTCl_gDNA_TAIR 

Ler- 0_VTCl_gDNA_Genbank 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4 GIF 



2410 2420 2430 2440 2450 

I I I I I I I I I I 

TCTCATTTATTATTCGTATTTATTTTGTCAAGAACCCTTGTCTCTAAAAT 
TCTCATTTATTATTCGTATTTATTTTGTCAAGAACCCTTGTCTCTAAAAT 
TCTCATTTATTATTCGTATTTATTTTGTCAAGAACCCTTGTCTCTAAAAT 
TCTCATTTATTATTCGTATTTATTTTGTCAAGAACCCTTGTCTCTAAAAT 



Col - 0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 
svt2 Kl Col R M3 GIF 



2460 2470 2480 2490 2500 

I I I I I I I I I I 

ATTCATAGAAAAAGAAAAGAGCCATTAATTAATGGCTTGAAGAAAGA TT 
ATTCATAGAAAAA AAAAGAGCCATTAATTAATGGCTTGAAGAAAGA TT 
ATTCATAGAAAAAGAAAAGAGCCATTAATTAATGGCTTGAAGAAAGA TT 



svt2 Kl Col R svt2 R M4 GIF ATTCATAGAAAAA AAAAGAGCCATTAATTAATGGCTTGAAGAAAGAATT 



Col - 0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 
svt2 Kl Col R MS GIF 



2510 2520 2530 2540 2550 

I I I I I I I I I I 

GGTGTATAAGC GTCTACGTGACCTTTAA TTAATTTACTT CCCCC 
GGTGTATAAGC GTCTACGTGACCTTTAA TTAATTTACTT CCCCA 
GGTGTATAAGC GTCTACGTGACCTTTAA TTAATTTACTT CCCCC 



svt2 Kl Col R svt2 R M4 GIF GGTGTATAAACCGTCTACCTGACCTTTAAATTTAATTTACTTTCCCCCCA 



Col - 0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 
svt2 Kl Col R M3 GIF 



2560 2570 2580 2590 2600 

I I I I I I I I I I 

AAAAAAGTCAAC ATTCAA CATGTG AATAAAAA TCAATATTGG TTT 
AAAAAAGTCAAC ATTCAA CATGTG AATAAAAAATCAATATTGG TTT 
AAAAAAGTCAAC ATTCAA CATGTG AATAAAAA TCAATATTGG TTT 



svt2 Kl Col R svt2 R M4 GIF CAAAAAGTCAACCATTCAAACATGGGGAGTAAAAAATCAATATTGGGTTG 



Col - 0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 
svt2 Kl Col R M3 GIF 



2610 2620 2630 2640 2650 

I I I I I I I I I I 

CTAAGTAA GTAAGTACCATATTATTAAATTATTTATTTT GGTAAATAC 
CTAAGTAA GTAAGTACCATATTATTAAATTATTT ATTTTGGTAAATAC 
CTAAGTAA GTAAGTACCATATTATTAAATTATTTATTTT GGTAAATAC 



svt2 Kl Col R svt2 R M4 GIF CTAAGTAAAGTAAGTACCATATTATTAAATTATTTTATTTTGGTAAAAGA 
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Col - 0_VTCl_gDNA_TAIR 

Ler- 0_VTCl_gDNA_Genbank 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4 GIF 



2660 2670 

. . . I I I 

GCACTCAATTTTTCTC 
GCACTCAATTTTTCTC 
GCACTCAATTTTTCTC 



2680 



2690 



2700 



I I I I I I I 

TCAACG GTGTATATAAACAAAAGGAG 
TCAACGGTGGGTGTATATAAACAAAAGGAG 
TCAACG GTGTATATAAACAAAAGGAG 
CAACACTCAATTTTTTCTCTTCACCGGGGGGGGGATGTAAAAAAAAAGAG 



Col - 0_VTCl_gDNA_TAIR 

Ler- 0_VTCl_gDNA_Genbank 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4 GIF 



2710 2720 2730 2740 2750 

I I I I I I I I I I 

TCTCCTTTGGAAAAAACTTGCCTATCATTTTGCCAACGAAC GTTCTTTC 
TCTCCTTTGGAAAAAACTTGCCTATCATTTTGCCAACGAAC GTTCTTTC 
TCTCCTTTGGAAAAAACTTGCCTATCATTTTGCCAACGAAC GTTCTTTC 
TCTCCTTTGGAAAAAAAAAAC TATC TTTTGCCGCC AAC ATTTTTTT 



Col - 0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 
svt2 Kl Col R M3 GIF 



2760 2770 2780 2790 2800 

I I I I I I I I I I 

TTCTTAATCAC AGCTCAGCCTGACGCAACCGCTCAGGCTGATCTCTTCC 
TTCTTAATCAC AGCTCAGC CTACGCAACCGCTCAGGCTGATCTCTTCC 
TTCTTAATCAC AGCTCAGCCTGACGCAACCGCTCAGGCTGATTGATTCT 



svt2 Kl Col R svt2 R M4 GIF TCTTTATTCA GCTCAGC CTGAGGAACCCCCAGTGGGCATTTTTTAA 



Col-0_VTCl_gDNA_TAIR 
Ler- 0_VTCl_gDNA_Genbank 
svt2 Kl Col R MS GIF 



2810 2820 2830 2840 2850 

I I I I I I I I I I 

AATTTACAGCCATTTCCCAGCTCAGATCTCTGATCCGGTGAGATCTCTCT 
AATTTACAGCCATTTCCCAGCTCAGATCTCTGATCCGGTGAGATCTCTCT 
CTTTTTT TCCCCGAAAAAAAAAAAACAGAAACTGGGGGCAATTTTTTT 



svt2 Kl Col R svt2 R M4 GIF AAAAG= 



Col - 0_VTCl_gDNA_TAIR 

Ler- 0_VTCl_gDNA_Genbank 

svt2 Kl Col R M3_G1F 

svt2 Kl Col R svt2 R M4 GIF 



2860 2870 2880 2890 2900 

I I I I I I I I I I 

CAAGGTAATGCCCCTGCAATTTTGCTTACTTCTCTGGTTGTGATATGCAT 
CAAGGTAATGCCCCTGCAATTTTGCTTACTTCTCTGGTTGTGATATGCAT 
TA - GCACCCCTCCAAAAAGGAAATTTTTTAATCTTCTGTCCAAAACCGT 



2910 2920 2930 2940 2950 

I I I I I I I I I I 

Col - 0_VTCl_gDNA_TAIR GTTCTTCGAATTTTCATCGTTTGTGATTTGAATTCTCATTTTGTATTTGC 
Ler- 0_VTCl_gDNA_Genbank GTTCTTCGAATTTTCATCGTTTGTGATTTGAATTCTCATTTTGTATTT C 

svt2 Kl Col R M3_G1F ATTTTTTTAAGCCCCCCTTT 

svt2 Kl Col R svt2 R M4 GIF 



Figure S2. Sequence alignment of the VTC1 promoter InDel polymorphism sequence of the Col-OTAIR database, the Ler-0 Genbank 
database, the svt2 K1 Col R M3 revertant (Col-like phenotype) and the svt2 K1 Col R svt2 R revertant (sW2-like phenotype). 

Horizontal arrows denote 5' respectively 3^ flanking regions of the sequence insertion, which is highlighted in grey, in the VTC1 promoter 
region in Ler-0 and svt2 K1 Col R svt2 R M4, which exhibits an svt2-like phenotype. The svt2 K1 Col R M3 mutant has a Col-like phenotype 
and share sequence similarities with the Col-0 sequence. R denotes revertant. 
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David Oppenheimer 

Department of Botany, University of Florida, Gainesville, FL, USA 



Not Approved: 31 January 2013 
Referee Report: 31 January 2013 

In this paper, the authors characterize a suppressor of the vtc1-1 mutation, which they named svt2. The 
authors characterize the phenotype of the vtc1-1 mutants that carry the suppressor mutation and show 
that the plants have characteristics reminiscent of the Ler accession. Molecular characterization of the 
suppressed plants show that the suppressor mutation is apparently a reversion of the original vtc1-1 
mutation, and surprisingly, have additional genomic signatures of the Ler ecotype as well as additional 
mutations. 

There are several problems with the experimental methods used in this manuscript. 

First, according to Figure 1 of the manuscript, the authors screened the generation (the mutagenized 

seed) for wt ascorbic acid content. This is a significant problem for the subsequent mutant analysis in this 
manuscript. When Arabidopsis seed are mutagenized, the individual cells of the meristem on the seed are 
mutagenized independently. When the seed germinate, the plants are genetic mosaics. In addition, only 
those mutations in the L2 layer that gives rise to the germ cells will pass on the mutations. Therefore, it is 
highly unlikely that a seedling with wt ascorbic acid levels would be isolated from the M^, because it would 

take the accumulation of many independent mutations, each of which would need to lead to suppression 
of vtcl . 

It is possible that a large sector of an seedling could contain a suppressor mutation that leads to wt 

ascorbic acid levels, but this sector would have to include cells in the L2 layer for the mutation to be 
passed on to the next generation. Also, an Ler-like sector should be obvious on a mostly Col-0 plant. 
Nonetheless, a sector that included the L2 would lead to segregation of the phenotype in the 

generation, because the cells in the sector would be heterozygous for the suppressor mutation. 

Second, the concentration of EMS commonly used for mutagenesis in Arabidopsis (0.2%) is known to 
cause multiple mutations per genome. When one isolates a mutant of interest from an EMS screen, one 
should back-cross it at least once to allow these other mutations to segregate away. Otherwise, one may 
observe unexpected results when analyzing the mutant of interest due to the effects of these other 
mutations. 

Third, when analyzing the sequence of the vtc1 gene in the original mutant and in the suppressor, svt2, 
the authors compared the sequence to the Ler and Col-0 sequences reported in Genbank and TAIR. 
Instead, the authors should sequence the vtc1 gene from their original vtc1-1 stock and the Ler accession 
that is present in their lab. This is because it is known that nucleotide polymorphisms arise regularly in lab 
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Stocks such that a comparison between a lab stock of Col-0 and the reference sequence can show many 
differences. Because the authors are reporting unexpected sequencing results, they should show the 
actual sequence traces (from both strands) for the individual base pair differences highlighted in Figure 
S1 . It would be appropriate to show these sequence traces in the supplemental data. Showing the 
sequencing traces for the base pair differences would demonstrate that the sequence differences are not 
due to ambiguous base calling or other sequencing errors. 

Fourth, the authors refer to the suppressed plants and their revertants has having a Ler-like or Col-like 
phenotypes. Because the phenotype of vtc1 is lower ascorbic acid levels, and the putative suppressor 
has wt ascorbic acid levels, the ascorbic acid levels in the revertants should be measured to show that 
they are revertants, instead of relying on the Ler or Col phenotype. 

Fifth, because seed and pollen contamination can explain the results, the authors need to explicitly state 
the degrees to which they tried to eliminate these possible sources of contamination. Were plants of more 
than 1 genotype grown together? Were seeds of more than 1 genotype collected in the same room? Was 
soil stored where plants were setting seed? etc. 

The single, Ler-like seedling found in the population can be explained as an Ler seed that 

contaminated the Col-0 vtc1-1 seed stock used for the mutagenesis. This can be tested by sowing 
several thousand Col-0 vtc1-1 seed from that seed stock, and screening them for the presence of any Ler 
contaminants. 

The results in Table 4 are the same as one would expect from seed contamination: the svt2 plants (with 
the Ler phenotype) have all Ler markers, and the revertants (with the Col phenotype) have all Col 
markers. The svt2 Col R1 plant highlighted in red shows the expected results if the parent of that plant 

was heterozygous for Col/Ler. Again, the authors should state what extraordinary measures they used to 
eliminate seed and pollen contamination. 

Once these comments are addressed, the other unexpected results can be examined in a new light. 

I have read this submission. I believe that I have an appropriate level of expertise to state that I 
do not consider it to be of an acceptable scientific standard, for reasons outlined above. 

Competing Interests: No competing interests were disclosed. 




Igor Kovalchuk 

University of Lethbridge, Lethbridge, AB, Canada 



Approved: 31 January 2013 
Referee Report: 31 January 2013 

Very unusual story. I am still puzzled how this is all possible. I can assume that original vtcl line had some 
Ler-1 background (may be from backcrosses). In this case it is possible that the seeds you started with for 
mutagenesis are highly heterogeneous and some have Ler genomes still present. Now, such a severe 
case of rearrangements due to combination of EMS and vtcl background is unbelievable. I wonder why 
other plants with even greater instability, such as ddml or msh2, have never had anything like this 
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reported. Maybe they have not looked for it hard enough? It would make sense to get the vtc1 mutant into 
rdr2 or rdr6 background (or both) and see whether this RNA cache plays any role - 1 would expect much 
lower chance of getting those revertants, same with reverse transcription mutants. 

I understand that the event is rare - a single plant was produced - but it would really make the entire story 
stronger if several different plants were produced. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 
Andy Pereira 

Departments of Crop, Soil and Environmental Sciences, and Plant Pathology, University of Arkansas, 
Fayetteville, AR, USA 

Approved: 25 January 2013 
Referee Report: 25 January 2013 

Very interesting experimental evidence of an inheritance phenomenon that is non-Mendelian and 
supports an RNA cache hypothesis. The data support the conclusions drawn, but some alternative 
explanations are put forth that can be addressed. 

The EMS mutagenesis treatment of the vtc1-1 genotype yields a single suppressor svt2 mutant that turns 
out to be a revertant of the vtc1-1 mutation, and is homozygous. Since this screen was from -1200 seed, 
it would be interesting to know if such revertant suppressor mutations are also be present in the original 
batch of vtc1-1 seed used for mutagenesis. Of course since the screen entails a tedious test of TTB on 
leaves of individual plants for AA content, it is not a recommended control test that should be done, but 
mechanistically the question remains if the locus is mutable without mutagenesis. What is curious is that 
the phenotype of the suppressor plant shows a Ler 'plant type' phenotype. 

Looking back at the history of the vtc1 mutant, the only time when the Ler and Col genomes were 
together, described in Conklin etal (1996), was when the vtc1 mutant was crossed to Ler for mapping. 
The description of the vtc1-1 (soz1) mutant stock (in TAIR) is given as result of 2 x backcrosses and an 
F3, presumably as a result of crossing to Col-0, but it might be useful to confirm that the stock has no Ler 
background and the seed used was progeny of single plant and not from a bulk seed lot. Since the VTC1 
locus has also been characterized by cyt1 and emb101 mutants, it would seem that the mutant alleles 
might have some disadvantage in being propagated and a 'residual heterozygosity' might persist by some 
mechanism. In addition, reversion to a wild-type phenotype svt2 might be facilitated by a selection of 
vigorous embryos into maturity. 

The sequence changes in the vtc1-1 and svt2 suggests an origin of a 'template' independent of Ler and 
Col alleles, and might also be sequences from another related ecotype. A screen of available Arabidopsis 
ecotype genome sequences should show such an alternate donor. 

Minor comment: On the PDF, page 10, need to use 'were' instead of 'where' in the sentence beginning "In 
all cases..." 
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I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 
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